Abstract
Mosquito vectors of pathogens (e.g., Aedes, Anopheles, and Culex spp. which transmit dengue, Zika, chikungunya, West Nile, malaria, and others) are of increasing concern for global public health. These vectors are geographically shifting under climate and other anthropogenic changes. As small-bodied ectotherms, mosquitoes are strongly affected by temperature, which causes unimodal responses in mosquito life history traits (e.g., biting rate, adult mortality rate, mosquito development rate, and probability of egg-to-adult survival) that exhibit upper and lower thermal limits and intermediate thermal optima in laboratory studies. However, it remains unknown how mosquito thermal responses measured in laboratory experiments relate to the realized thermal responses of mosquitoes in the field. To address this gap, we leverage thousands of global mosquito occurrences and geospatial satellite data at high spatial resolution to construct machine-learning based species distribution models, from which vector thermal responses are estimated. We apply methods to restrict models to the relevant mosquito activity season and to conduct ecologically plausible spatial background sampling centered around ecoregions for comparison to mosquito occurrence records. We found that thermal minima estimated from laboratory studies were highly correlated with those from the species distributions (r = 0.87). The thermal optima were less strongly correlated (r = 0.69). For most species, we did not detect thermal maxima from their observed distributions so were unable to compare to laboratory-based estimates. The results suggest that laboratory studies have the potential to be highly transportable to predicting lower thermal limits and thermal optima of mosquitoes in the field. At the same time, lab-based models likely capture physiological limits on mosquito persistence at high temperatures that are not apparent from field-based observational studies but may critically determine mosquito responses to climate warming. Our results indicate that lab-based and field-based studies are highly complementary; performing the analyses in concert can help to more comprehensively understand vector response to climate change.
Author summary
Mosquito vectors are strongly affected by temperature, and their distributions are likely to shift under climate change. Lab studies show that mosquito abundance has a unimodal response to temperature with thermal optima, upper and lower thermal limits. However, it remains unknown how mosquito laboratory-derived thermal responses relate to the thermal responses of mosquitoes in nature. We used a global database of field-collected mosquito occurrences, geospatial environmental covariates, and species distribution models to estimate the relationship between temperature and probability of mosquito occurrence. We found that thermal minima (r = 0.87) and, to a lesser degree, thermal optima (r = 0.69) estimated from laboratory studies were correlated with those from the species distribution models. For most species, we did not detect thermal maxima. These results suggest that laboratory studies and field-based machine learning studies are complementary. Together, they can help to better understand vector response to climate change.
Introduction
Mosquito-borne diseases (e.g., malaria, dengue, Zika, chikungunya, West Nile, yellow fever) are responsible for a significant worldwide burden of infectious disease and represent a major threat to global public health [1–5]. As small-bodied ectotherms, mosquito vectors are sensitive to environmental conditions and, in particular, to abiotic factors such as temperature [6,7]. Climate change is likely to alter the climatic and habitat suitability for and the geographic distribution of mosquitoes, in turn affecting the distribution of pathogens they transmit [8–11]. Understanding the limitations that temperature and other ecological conditions place on mosquito vectors is critically important for predicting how mosquitoes and vector-borne diseases will respond to climate change.
Previous laboratory experiments have measured the relationship between temperature and mosquito life history traits such as biting rate, adult mortality rate, mosquito development rate, and probability of egg-to-adult survival for many vector species [12–15]. Together, these trait thermal performance relationships can provide a mechanistic estimate of equilibrium mosquito abundance as a function of temperature [6,10,16]. For mosquitoes for which these relationships have been estimated, the thermal response curves of individual traits follow predictable patterns: they decline to zero at lower thermal minima and upper thermal maxima and peak at intermediate thermal optima, as expected from first principles of physiology and enzyme kinetics [16–18]. In aggregate, modeled population-level mosquito abundance has a similar unimodal relationship with temperature [6], reflecting the underlying life history traits [10]. These studies confirm a core tenet of the metabolic theory of ecology, which states that organismal physiology operates within and is restricted by thermal limits [19]. Temperature-dependent effects of these life history traits in turn affect transmission of mosquito-borne disease [6,20–33]. Despite these clear predictions from ectotherm physiology and laboratory thermal performance experiments, it remains unknown to what extent these experimental, laboratory-based mosquito thermal responses predict the realized thermal responses of mosquito populations in the field. In particular: (1) is temperature an important predictor of mosquito distributions? If so, (2) are effects of temperature on mosquito occurrence nonlinear? And if so, (3) do the predicted thermal optima and limits from laboratory experiments align with thermal optima and limits measured in the field? Altogether, our study aims to identify how the gold-standard of measuring mosquito thermal tolerance (lab-based thermal performance curves) and advancing artificial intelligence efforts can be combined to identify how mosquitos will respond to global change within their natural environment.
The rapid rise of biodiversity data, machine learning, and open source satellite imagery in the past decade has enabled new types of approaches necessary to address these questions. Species distribution models offer a way to connect data on species occurrences with environmental covariates to understand how environmental conditions influence the probability of a species occurring in a given location (i.e., habitat suitability). Due to the challenges of accurately detecting a species’ absence, species distribution modeling typically compares locations in which a species has previously been identified (occurrences) and an artificially-generated set of background points (pseudo-absences) which approximate the potentially-habitable and accessible area for a given species [34–37]. Species distribution models have been used to characterize the ecological niches (i.e., habitat and climatic requirements) of organisms across the tree of life, including elephants in South Asia [38], lynxes in Canada [39], ants in Australia [40], and rare plants in California [41]. Beyond predicting geographic ranges, these models can indicate which environmental covariates are most important for predicting species occurrences, and the functional relationships between environmental conditions and species distributions.
Species distribution models often rely on remotely sensed satellite imagery to provide information about biotic, abiotic, and anthropogenic variables. Recent advances in storage and processing of remotely-sensed imagery, including publicly-available platforms like Google Earth Engine [42], have increased the resolution and spatial extent over which these models can feasibly be run. In parallel, the Global Biodiversity Information Facility (GBIF) leverages crowdsourced data collection and digitized biodiversity data, where occurrence points of many species are compiled within a central repository [42]. Moreover, new algorithms and advances in machine learning have improved the predictive capacity of species distribution models, and have facilitated the development of more complex models with interactive and nonlinear relationships between predictors and responses that can potentially capture ecological complexity. Together, these innovations provide ripe new avenues and abundant data through which to tackle ecological questions.
Despite the promising advances in species distribution modeling, major limitations also remain. First and most importantly, species distribution models are correlative and not causal, so the factors that best capture distributional limits (i.e., discriminate between presences and pseudo-absence) may not fully capture the true physiological and ecological limits. This may occur because biologically limiting factors are not easily measured or highly variable across distributional ranges, because may have high-order and non-linear correlations with other factors, or because organisms have not yet encountered their limits in a given ecological factor due to constraints posed by other factors. For example, a temperature limit may not be easily identifiable from species distribution models because a species is constrained to a habitat type where such limiting temperatures do not currently occur. Alternatively, species that are actively expanding their range may not have yet expanded to occupy the entire available suitable habitat. This means that species distribution models have limited capacity to extrapolate future changes in ecological constraints on a changing world, without complementary knowledge from mechanistic approaches such as experiments and mechanistic models. In this work, we build on the complementary strengths of two distinct approaches—the biological mechanism and interpretability of simple mathematical models based on controlled laboratory experiments, and the complexity and flexibility of species distribution models that capture interactive and nonlinear environmental relationships—to ask whether temperature, a fundamental biological constraint, has similar inferred effects on the distributional limits of key mosquito vector species.
We focus on the vectors of the world’s highest-burden mosquito-borne diseases (malaria, dengue, chikungunya, Zika, West Nile, and other arboviruses)—Aedes aegypti, Aedes albopictus, Anopheles gambiae, Anopheles stephensi, Culex pipiens, Culex quinquefasciatus, and Culex tarsalis—each of which has well-characterized thermal performances curves in laboratory settings and a wealth of data on occurrences in the field [6,9,16,25,28,43,44]. Ae. aegypti and Ae. albopictus are important vectors of dengue, chikungunya, yellow fever, and Zika viruses. An. gambiae and An. stephensi primarily vector the Plasmodium spp. protozoans that cause malaria. Cx. pipiens, Cx. quinquefasciatus, and Cx. tarsalis transmit West Nile, St. Louis encephalitis, Western Equine encephalitis, and other zoonotic viruses, and Cx. quinquefasciatus additionally transmits lymphatic filariasis.
While previous mosquito species distribution models have focused on estimating habitat suitability and predicting occurrence [8,9,44], our goal is different: to specifically dissect the relationship between temperature and probability of mosquito occurrence, while incorporating other constraints on occurrence, and to compare this relationship among mosquito species and with laboratory-based model predictions. For this reason we created new models, rather than using published ones, that use consistent methods across multiple mosquito species. Further, we aimed to construct species distribution models for each focal species using spatially- and temporally-similar data sources and consistent model assumptions in order to compare thermal dependence across species.
We make two important methodological advances to ensure that our species distribution models capture true thermal limits on mosquito occurrence, rather than sampling bias or restricted geographic ranges that covary with temperature. First, we sample background pseudo-absence points only from the set of ecoregions in which the focal mosquitoes occurred, as well as adjacent ecoregions. This ensures that comparator pseudo-absence points are selected from regions where the focal mosquito species could have realistically been found but were not, or in other words, zones that are ecologically plausible for an occurrence. Second, we restrict temperature measurements to an ‘activity season’ during which each mosquito species is blood-feeding and reproducing, and not in dormancy or torpor. This captures temperature constraints within a physiologically realistic period, rather than constraints due to overwintering or drought persistence, which are less likely to be comparable to laboratory-based trait thermal response experiments. By using global mosquito occurrences from 2000–2019, geospatial satellite-derived covariates at high spatial and temporal resolution, bias-reducing methods described above, and a gradient-boosted classification tree machine learning algorithm well-suited for prediction tasks, we aim to provide evidence that spans across continents and decades for the globally important vectors of human malaria and arboviral disease. By leveraging both approaches, we may be able to use their complementary strengths to minimize each of their limitations. With a greater understanding of thermal constraints on vector species occurrence, we can improve the ability to accurately project and mitigate the future impacts of climate change on mosquito-borne disease distributions.
Methods
Overview and study period
We focused on 2000–2019 to isolate recent land-use and climate patterns but provide a long enough time period to capture mosquitoes’ stable spatial distribution. The environmental covariates and mosquito occurrences were both extracted for this time period, providing temporal consistency in the analysis [45]. An overview of the methods can be seen below (Fig 1). Computation was performed in R v4.1.1, Google Earth Engine and the Sherlock computing cluster at Stanford University.
Environmental predictors
Environmental covariates were defined a priori in order to avoid data-dredging [11], and were based on a literature search of previous species distribution models. Variables were informed by ecological and biological relevance, expectation to play a role in mosquito vector ecology, and top contributor status in previous mosquito species distribution models [46–49]. Environmental covariates were computed using remotely sensed and reanalysis data, and resampled using bilinear interpolation to consistent 1 km x 1 km resolution on Google Earth Engine. This approach using global satellite data avoids the spatial interpolation limitations of common climatic datasets such as WorldClim, in which data are sourced from ground meteorological stations with uneven geographic distribution and nonrepresentative selection.
In order to verify minimal overlap between predictor variables, a pairwise correlation analysis was performed with a Pearson’s correlation coefficient r < |0.8| threshold (Fig A in S1 Text). Annual averages for covariates were computed across the study period, providing the characteristic climatic conditions across the study’s temporal range.
Temperature predictors
While many species are active year-round (Table 1), numerous species physiologically undergo photoperiodic diapause in the winter months when ambient light is low in order to conserve energy for life sustenance [50] or aestivate during the dry season when no breeding habitat is available [51]. To capture the temperature that mosquitoes experience while active, temperature covariates were calculated over the activity season.
Table 1. Mosquito species by activity season.
Mosquito Species | Activity Season (Reference) | Raw Number of Occurrences | Unique Cell Centroids |
---|---|---|---|
Aedes aegypti | Year Round [57,58] | 37,115 | 9,299 |
Aedes albopictus # | Photoperiod [52,59] | 39,488 | 8,688 |
Anopheles gambiae | Precipitation [51,55] | 13,650 | 903 |
Anopheles stephensi | Year Round [60] | 1,232 | 358 |
Culex pipiens | Photoperiod [53,54] | 98,276 | 1,949 |
Culex quinquefasciatus | Year Round [61,62] | 30,978 | 2,670 |
Culex tarsalis | Photoperiod [63] | 44,496 | 909 |
# This species enters winter diapause in some regions such as the Northeast US, but not other regions like the Southeast US.
For the purposes of this study, the photoperiodic activity season was defined as the time period encompassing days with at least 9 hours of sunlight (Fig B in S1 Text). This definition of the activity season was chosen based on a variable separate from temperature and on laboratory studies that found that either 8 or 9 hours of light can induce diapause in mosquitoes in the absence of temperature change [52–54]. The photoperiod-based constraint only limited the activity season in higher latitudes and did not affect lower latitudes (Fig B in S1 Text). Other tropical mosquito species (e.g., Anopheles gambiae) do not enter photoperiodic dormancy due to their equatorial setting. Rather, these species’ activity seasons are instead constrained by lack of precipitation, when insufficient habitat is available for immature development and survival. Studies have highlighted how Sahelian Anopheles gambiae mosquitoes virtually disappear in the dry season then re-establish in the wet season through aestivation behavior [51,55], and climatological studies in the Democratic Republic of the Congo characterize the dry season as having less than 50 mm of rain in the past month [56]. Precipitation activity season is therefore defined in our study as the time period in which there is, on average, at least 50 mm of precipitation in the past 30 days (Fig C in S1 Text).
Predictors capturing thermal central tendency include year round temperature mean, photoperiod activity season temperature mean, and precipitation activity season temperature mean. Measures of seasonal thermal variation include year round temperature standard deviation, photoperiod activity season temperature standard deviation, and precipitation activity season temperature standard deviation.
Other environmental predictors
Environmental variables other than temperature included measures of vegetation cover, such as enhanced vegetation index mean, enhanced vegetation index standard deviation, and forest cover percentage, to capture land cover that might affect mosquito habitat suitability (Table B, Fig D in S1 Text). To determine which environmental predictors to include, we first looked to the literature to understand which ecological variables were consistently utilized in past SDM models and which were found to be the most important predictors in other papers for our target mosquito species. Among those top predictors from previous papers, we selected covariates that could best capture the underlying vector ecology and biology.
For example, in this process, we identified that human population density is a variable that may be ecologically important as it represents the fact that many mosquito species are highly human-dependent with human blood-feeds, inhabit urban or peri-urban niches, and breed in artificial containers or pools of water in close proximity to human settlements. Indeed, human population density was the topmost predictor for in models for Ae. aegypti [48], and for Cx. pipiens [47]. Precipitation of the driest quarter, on the other hand, was thought to capture suitable mosquito breeding habitats in the setting of less standing water in the dry season, and has been found to be an important predictor for An. gambiae [46], for Ae. aegypti and Cx. pipiens [49]. Enhanced vegetation index describes the quality of vegetation features like leaf area, canopy cover, and sugar resources that may provide alternate food sources or resting sites for many species [47], and both mean and standard deviation of vegetation index have important predictors for Cx. pipiens in a previous model [47].
Ultimately, human population density and cattle density were included to capture sources of blood meal, anthropophilic hotspots, and human-seeking behavior. Precipitation of the driest quarter, precipitation of the wettest quarter, and surface water seasonality were included to capture hydrology-driven characteristics influencing mosquito breeding site availability or abundance, while wind speed was included to capture potential aerial dispersal limitation of mosquitoes and relative humidity was included because it affects survival and life history [64]. Precipitation of the driest quarter and precipitation of the wettest quarter were defined as the set of 3 consecutive months with the least and most precipitation, respectively, across the 20-year study period.
Occurrence and background
Presence occurrences
Focal mosquito species were selected based on global disease burden, availability of life history trait data from published laboratory studies, and abundance of publicly-accessible species occurrences, resulting in seven species: Ae. aegypti, Ae. albopictus, An. gambiae, An. stephensi, Cx. pipiens, Cx. quinquefasciatus, and Cx. tarsalis. Collectively, these seven species span nearly all habitable continents (Figs E-K in S1 Text). Species occurrence records for each species were obtained from the Global Biodiversity Information Facility (GBIF) for the years 2000–2019. Supplemental occurrence points were obtained from two papers for An. stephensi [65] and An. gambiae [66], which had low sample sizes from the GBIF database. Occurrences explicitly tagged in Africa were discarded from the supplemental An. stephensi data, as these are part of an ongoing species invasion outside of the native range of the species in South Asia [65], and species distributions models have limitations in detecting environmental suitability for invasive species as they do not yet occupy their environmental niche and are not yet in equilibrium with their environment [67].
Raw datasets were cleaned to remove occurrences with unknown basis of record and those obtained through fossil records, due to the temporal and geographic uncertainty between when and where the organism actually existed during its lifetime and its location of fossilization. Uncertainty in the latitude and longitude coordinates of occurrence points were taken into account through a two-step process. First, points with a coordinate uncertainty less than 1000 meters—the diameter of our environmental covariate grid cell size—and those points with missing coordinate uncertainty were selected. Of the points with missing coordinate uncertainty, we retain only those points with a latitude and a longitude with at least two significant decimal points (i.e., each hundredth corresponds to approximately 1.11 km). The occurrence data were then restricted to points that reside on landmasses and not in the ocean, those with non-missing values for all covariates, and, when relevant, and those that occurred in areas where the species’ activity season length was greater than 0 (i.e., when the precipitation was above a certain value in the grid cell, i.e., precipitation activity season, or when there was a certain amount of daylight hours in the grid cell, i.e., photoperiod activity season) (Figs B-C in S1 Text). We filtered the occurrences to the cell centroids of the unique 1 km x 1 km cells in which occurrence points were obtained, which serves to spatially thin the occurrence points and reduce the effect of biased sampling [68] (Table 1; Table A in S1 Text).
Pseudo-absence background sampling
Species distribution models work by comparing the environmental covariates that best predict locations with species occurrences to those in which the species does not occur, to understand the environmental conditions that constrain a species’ distribution. As the majority of information is on species occurrences rather than species absences, methods for species distribution modeling have developed to sample pseudo-absences: points that represent the area in which a species could have been sampled yet was not reported. In order for these sampled points to serve as plausible points where the species of interest was not found, they should be within the species’ accessible area, or the region that is reasonably reachable by the species through dispersion or migration over the relevant time period [34,36,37]. Choosing pseudo-absences that occur within the same ecoregions, i.e., geographic regions that possess similar species and community assemblages, as occurrences provides a way to geographically restrict possible background points. Moreover, these points should be chosen in a way that takes into account sampling bias and collection effort.
To sample background points, we first created a bias mask to probability-weight the sampling. 72 million occurrences from Class Insecta, filtered to the same criteria as mosquito occurrence points (i.e., <1000m coordinate uncertainty or at least two significant decimal points of latitude and longitude, 2000–2019 study period, no fossil record or literature points), were extracted from GBIF. In order to capture the frequency of GBIF sampling conducted in a specific geographic location, the number of insects were tallied for each 1 km x 1 km grid cell, and these counts were converted into proportions of all insects. This proportion was regarded as the probability weight for that given grid cell. To provide adequate sample size while avoiding a skew of the overall sample towards disproportionately favoring pseudo-absences over occurrences, we selected a number of unique background cell centroids equal to twice the number of unique occurrence centroids [69]. Background cells were sampled at random without replacement and weighted by the bias mask from the geographical region that was bounded by the focal species’ ecoregions of occurrence. In addition, we include adjacent ecoregions in the background sampling space to promote contiguity in the potential ecological spaces from which pseudo-absences can be drawn and to better encapsulate the broad range of environmental covariates that a given species may experience in nature. Ecoregions were defined as RESOLVE Ecoregions that intersected with a given species’ occurrence centroids, buffered with a 200 km radius buffer that approximated an upper end of the wind-assisted geographic mobility range of a mosquito [70,71]. By buffering occurrences and then choosing the set of ecoregions and adjacent ecoregions in which these buffered points lie, we approximate what is known in ecological theory as the ‘accessible area,’ described above [34,36,37]. Because our focal questions centered on species’ thermal breadth, we wanted to ensure that as broad a temperature range as was reasonable was included in the background sampling. Specifically, we checked whether our background sampling scheme provided a temperature distribution at least as wide as that of the occurrences—a consideration important to ensure that the statistical model can ascertain thermal minima or maxima.
Due to the unequal historically unequal sampling effort for Insecta species in GBIF, the distribution of occurrence and background points was weighted towards North America and Europe particularly for Ae. aegypti and Ae. albopictus (Figs E-F in S1 Text). If there are regional differences in the thermal minima and optima between continents, the estimates will be overweighted towards North America and Europe. To address this potential bias, we additionally fit species distribution models for these two species dropping the occurrence and background points for North America and Europe and re-identifying thermal minima and optima to understand whether our estimates would change with different geographic representation in our dataset.
Species distribution modeling
XGBoost model fitting
Raster values for each occurrence and background centroid were extracted from the stack of covariates for a given species. The data was partitioned into a training and evaluation set, where 80% of the data was randomly selected for the training set and 20% was randomly selected for the validation set. We used stratified sampling so that there was an equal proportion of presence points in each set. We used gradient boosted classification trees to model the training data, as this class of algorithms fits flexible and nonlinear relationships (including thresholds) between environmental conditions and probability of species occurrence, identifies interactions between covariates due to the structure of the trees, and has been successfully used in other species distribution models [9,43,72]. Gradient boosted trees are a type of supervised learning algorithm that iteratively fits trees, each time fitting to the residual errors from the previous predictions, effectively ensembling weaker learners to accurately predict a target variable [73]. We fit models using eXtreme Gradient Boosting (XGBoost)[74] in R. Among the strengths of the XGBoost algorithm are its rapid computational speed when dealing with large-scale data, ability to handle class-imbalanced training sets (i.e., the number of observations are not equal across each level of the outcome variable), and ability to tolerate highly-collinear predictor variables. To quantify variation around model performance and output, we fit the model with 20 random train and validation splits.
Bayesian hyperparameter optimization
Gradient boosted classification trees rely on hyperparameters that control how each tree is fit and how individual trees’ estimates are combined. Bayesian optimization was conducted to tune key hyperparameters initialized with a range of user-inputted values, including learning rate (range: 0.0001 to 0.3), maximum tree depth (range: 2 to 50), minimum child weight (range: 1 to 50), subsample of observations used in each tree (range: 0.25 to 1), subsample of columns to use in each tree (range: 0.5 to 1), and minimum split loss (range: 0 to 100) [74,75]. For each round of the Bayesian optimization (up to a maximum of 24 rounds), we use 5-fold cross validation where we split the training data into 5 disjoint sets, and train the model on each combination of 4 folds while predicting out-of-sample for the fifth fold. We use the average out-of-sample log loss averaged over all 5 folds to identify the optimal number of boosting rounds (up to 10,000) for each iteration of the Bayesian optimization. After all iterations of the Bayesian optimization, we select the set of hyperparameters with the lowest out-of-sample log loss and train a final model using all 5 folds—representing all of the training data, which is 80% of the full data set.
Interpreting the model
Receiver operating characteristic (ROC) curves and their area under the curve (AUC) values, which graphically and numerically illustrate the discrimination ability of a binary classifier, were computed for model predictions on both the training and evaluation sets. We were particularly interested in two aspects of the resulting models: the inferred relationship between the probability of occurrence and temperature, and the relative importance of temperature in predicting a species distribution, compared to other environmental covariates. To understand the relationship between temperature and occurrence, we plotted univariate partial dependence plots for temperature mean and temperature standard deviation. Depending on the species, we use annual temperature, temperature during the photoperiod-dictated activity season, or temperature during the precipitation-dictated activity season. The partial dependence plots depict the marginal effect of temperature on the predicted value of the model and, at each temperature, is calculated as the mean predicted value (in this case, probability of species occurrence) across all combinations of other covariates observed in the dataset [76]. To assess the importance of temperature in relation to other environmental variables, measured in gain (i.e., the increase in accuracy that a given variable provides when a regression tree branch is split upon that variable) [75], we computed variable importance scores for the full set of predictors and compared them with temperature. We computed variable importance and PDPs for each bootstrapped iterations described above.
Mosquito abundance curves from the lab
Physiological life history trait thermal performance curves were collected from published literature [16,29,31]. Briefly, in previously published work asymmetric responses like mosquito development rate (MDR) were modeled using a Briere function, and both concave-down symmetric responses like eggs per female per day (EFD) and probability of egg-to-adult survival (pEA) as well as concave-up symmetric responses like mortality rate (μ) were modeled using quadratic functions. 5,000 posterior draws were selected from each distribution, and mosquito abundance M(T) was calculated using Eq (1), which was originally derived in [6], adapted from [10]. Median and 95% credible intervals were computed for each species’ M(T) and critical thermal limits.
(1) |
Comparison of thermal relationships from XGBoost versus lab-based models of mosquito abundance
We compared thermal minima and optima of the partial dependence plots from the XGBoost model to the mosquito abundance M(T) curves from the lab trait trajectories. For species distribution models, thermal minima were identified as the temperature at which the partial dependence plot began increasing, which we operationalized as the first time the empirical derivative was positive and stayed positive for the next step in temperature as well (Fig 1C). Thermal minima could also correspond to the threshold temperature at which species probability of occurrence is increasing most rapidly, so we supplementarily identify thermal minima as the temperature where the empirical derivative is largest (Fig N in S1 Text). Thermal optima were identified as the point where the empirical derivative was zero and the partial dependence plot was at its maximum (Fig 1C). We did not identify thermal maxima, as few species had partial dependence plots that clearly declined after the thermal optima and then reached a lower plateau that was within the range of observed temperatures. For the mosquito abundance M(T) curves derived from laboratory studies, to determine the thermal minima, we identified the temperature at which M(T) exceeded zero for each M(T) curve calculated from posterior draws and then calculated the median, 2.5th and 97.5th percentiles to produce a central estimate and confidence interval for thermal minima. For thermal optima, we identified the temperature where M(T) was highest for each curve, and similarly calculated median and 95% confidence intervals. We calculate the Pearson correlation between the median species distribution model-based thermal minima and the median lab-based thermal minima, and repeat for thermal optima to quantify how well the two methods compare. In particular, because temperature varies substantially in the field and M(T) curves are based on constant temperature measurements in the lab, we expect nonlinear averaging to affect the exact thermal limits observed in the field.
Results
The species distribution models predicted the occurrence of all focal species with high accuracy: discrimination across both training (in-sample) and evaluation (out-of-sample) datasets produced an area under the receiver operating curve (AUC) above 0.9, where 1 indicates perfect discrimination of presence and absence and 0.5 is no better than a coin toss (Fig 2; Table C in S1 Text; Fig L in S1 Text). Of the focal species, Cx. pipiens recorded the lowest out-of-sample evaluation AUC of 0.935. The fact that the AUCs in the evaluation set were comparable to the training set suggests that the models are learning general patterns, differentiating between training and evaluation sets, and not overfitting to the training data.
Measures of temperature, i.e., either annual or activity season-limited (as appropriate) temperature mean and temperature standard deviation, were consistently important predictors of mosquito occurrence, as measured by gain (Fig 2). For all species, mean temperature was among the top seven predictors. For Ae. aegypti, Ae. albopictus, and Cx. quinquefasciatus temperature mean was the top predictor and temperature standard deviation was second or third (Fig 2). An. gambiae, Cx. pipiens, and Cx. tarsalis had temperature standard deviation as a more important predictor than temperature mean (Fig 2).
Beyond temperature, different predictors were important for different mosquito species. Based on mean gain across the 20 bootstrapped iterations, forest cover was the most important predictor for Cx. tarsalis and second most important for An. gambiae, and relative humidity was most important for An. stephensi (Fig 2). Precipitation variables were among the top-5 predictor variables for all species. Precipitation of the wettest quarter was among the top-5 predictors for Ae. aegypti, Ae. albopictus and Cx. pipiens (Fig 2). Precipitation of the driest quarter was among the top-5 predictors for An. gambiae and Cx. quinquefasciatus, and both precipitation variables were among the top-5 predictors for An. stephensi and Cx. tarsalis (Fig 2). Surface water seasonality, however, was the least important predictor variable for all species distribution models (Fig 2).
While mechanistic relationships between temperature and mosquito abundance, based on laboratory-derived M(T), were unimodal and hump-shaped in form, the temperature relationships inferred from the statistical models (univariate XGBoost partial dependence plots; PDPs) were typically nonlinear with steep thresholds and plateaus (Fig 3). Both the mechanistic and statistical models showed steep increases as temperatures exceeded lower thermal limits (Fig 3). By contrast, while M(T) consistently responded unimodally to temperature, relationships from XGBoost PDPs mostly only showed lower thermal limits and thermal optima, but not upper limits (Fig 3). The difference in functional forms between M(T) and XGBoost PDPs is consistent with the fact that they each describe distinct processes. The mechanistic model, M(T), describes a relationship between temperature and mosquito abundance that continuously varies with temperature and the underlying laboratory-measured traits. By contrast, the PDPs from XGBoost capture the relationship between temperature and mosquito occurrence probability, which we expect to rise and fall sharply and reach a plateau at intermediate, suitable temperatures. The M(T) curves for all species showed clear thermal minima, optima, and maxima because the underlying laboratory experiments captured the full range of temperatures at which mosquito performance is optimized and at which it declines to zero.
Given that both the laboratory-based mechanistic approach (M(T)) and the statistical approach (XGBoost PDPs) predicted thermal minima and optima for each species, we asked whether the results of the two approaches were concordant. The estimates of thermal minima between M(T) and PDP were highly correlated across species (r = 0.869; Fig 4; Figs M-N in S1 Text), suggesting that M(T) captures key species-specific temperature constraints on occurrence in the field. Mosquito species of the Aedes genus exhibited thermal minima that were approximately 5°C cooler in the field than predicted in the laboratory (Fig 4; Fig M in S1 Text). Cx. tarsalis and Cx. pipiens consistently among the two lowest thermal minima, and An. gambiae and An. stephensi had the two highest thermal minimum when viewed across M(T) and PDP (Figs 4 and M-N in S1 Text). The alternative identification of thermal minima as the threshold temperature with the greatest increase in species occurrence resulted in a similarly high correlation between M(T) and PDP-based thermal minima (r = 0.871), but resulted in thermal minima approximately 2–5°C warmer in the field than predicted by the laboratory for Anopheles mosquitoes (Fig N in S1 Text). Thermal optima were similar, although more weakly correlated, between M(T) and PDP estimates (r = 0.687), supporting, at least in part, the conclusion that field based observations capture key components of lab-based non-linear response to temperature. However, the relationship appeared to deviate the most for temperate species Cx. pipiens and Cx. taralis, for which thermal optima were 9.4°C and 11.5°C lower, respectively, in the field versus laboratory (Fig 4; Table D in S1 Text; Figs M-N in S1 Text). We could not compare thermal maxima because our XGBoost models did not identify thermal maxima for any focal species.
When fitting models for Ae. aegypti and Ae. albopictus without North America (Figs O-P in S1 Text) and Europe (Figs Q-R in S1 Text) occurrence and background points, we find little difference in detected thermal minima and optima or their correlations to the mechanistic lab-based estimates (Fig S in S1 Text), suggesting there are likely minimal differences in critical thermal values between geographic regions.
Discussion
For all seven major vectors of human disease we investigated, species distribution models captured occurrence probability with high discrimination and accuracy and temperature was an important predictor (Fig 2; Fig L in S1 Text). In particular, temperature mean and standard deviation were highly important predictors across all focal species (Fig 2). This finding echoes the importance of temperature as a predictor in past models, although many such models define temperature differently from our study (e.g., temperature suitability, temperature of the hottest quarter, temperature of the coldest month) [43,48,77,78]. Four species (Ae. aegypti, Ae. albopictus, Cx. pipiens, and Cx. quinquefasciatus) had either temperature mean or standard deviation as their most important predictor (Fig 2). Nonlinear relationships with temperature from ‘bottom-up’ models of mosquito abundance from lab life-history traits were partially recapitulated in the ‘top-down’ statistical models of global-scale occurrence data, particularly at the lower end of the temperature range, consistent with previous work on other mosquito species [48]. In general, thermal minima and optima were highly comparable across species between the trait-based and statistical approaches (Fig 4). We find that the thermal minima are more strongly correlated than the thermal optima, consistent with the fact that the species distribution models aim to identify the conditions for species occurrence and the partial dependence plots show steep increases from low temperature not conducive to species occurrences then plateau. By contrast, PDPs show little change in occurrence probability with further changes in temperature. In contrast to the thermal minima and optima, thermal maxima were not comparable: XGBoost did not consistently detect thermal maxima for occurrence in the field, despite laboratory evidence that mosquito life history traits decline precipitously at high temperatures. This suggests that thermal constraints on species may not be identifiable using observational approaches, such as species distribution models, alone.
Thermal physiology theory and experiments have established that organismal performance is constrained to an operative range of temperatures at which key functions can occur and populations can stably persist [79–83]. Yet, realized species distributions may not clearly reflect these thermal constraints if other factors are also important for constraining distributions, including rainfall, seasonality, biotic interactions, habitat, movement rates, and the range of temperatures that occur within accessible regions [45,84–90]. For example, aridity may constrain mosquito distributions beyond direct high-temperature constraints. Likewise, average temperatures across activity seasons may not reach levels high enough to exclude mosquito persistence [7,79,91–93]. Alternatively, more extreme temperature variation at low or high means can limit organismal performance, and even moderate variation can reduce performance near optimal temperatures (i.e., where M(T) curves are concave-down; Fig 3). The impact of temperature variation could explain the observation that for species with cooler thermal minima and optima, estimates at constant temperatures in the lab were up to 10°C warmer than estimates in the field (Fig 4).
This highlights the importance of climate change projections that consider both field-based estimates of thermal and other constraints on species distributions and more mechanistic estimates, particularly for thermal optima and maxima, which may be difficult to observe in the field under current and historical conditions [94]. Simply projecting species distribution models like those derived here under future climate change scenarios is likely to overlook the potential for warming temperatures to exceed thermal optima and limits for species persistence. Moreover, it is critical for studies of vector-borne disease transmission under future climate change to account for the gap between potential and realized future vector thermal niches, as vectors may not immediately track geographic shifts in climate suitability. Approaches that combine mechanistic thermal performance information (e.g., from laboratory studies and life history models) with statistical inference based on current distributions are most likely to accurately capture current and future environmental constraints on species persistence, including for noxious species like disease vectors.
This study incorporates several methodological innovations that advance upon previously published species distribution modeling methods for mosquitoes, particularly focused on estimating temperature bounds, with applications for climate change models. First, we computed covariates aligned to the time of collection of our mosquito occurrence data, in contrast to numerous studies that use climatologies estimated from the years preceding the occurrence point sampling time frame [9,43,47,49,95,96]. Second, given our explicit goal of estimating thermal limits, we restricted our measures of average temperature to the relevant activity season of each focal species [97], which ensures that we are using the temperature range of the mosquito in the field when that species is active and non-dormant (and therefore comparable to laboratory experimental measurements). Third, we selected background points from the ecoregions in which a species occurs plus a 200 km buffer, as well as adjacent ecoregions, to ensure that we are estimating temperature limitations based on regions that are plausible based on habitat suitability. This method, grounded in ecological theory [34,37,98] and similar to background limitation methods for non-mosquito species in the literature [41,99–102], functionally decouples habitat and temperature. This is largely a new advancement for mosquito species distribution modeling. Finally, while most mosquito species distribution modeling studies focus on identifying predictors of one or two mosquito species, here we explicitly aim to infer thermal limits and optima and to compare these to mechanistic estimates [77] among multiple medically-relevant mosquito vector species [43,78,103–105].
However, our study also has several limitations. First, the lack of thermal breadth in the background sampling may limit our ability to accurately infer thermal maxima. Mosquito species that occur in hotter climates, such as Ae. aegypti, An. stephensi, and An. gambiae, lack a sufficient amount of ecoregion-matched background points that have comparably high, or higher, temperatures than the occurrence points, making it difficult to estimate the temperature range that would be hot enough to prevent occurrence (i.e., identify thermal maxima) [106] and some species may not currently be constrained by their physiological upper thermal limits. Second, because our goal was to create accurate but comparable species distribution models for seven mosquito species across their global extent, we primarily relied on publicly available GBIF occurrence points (see Methods for data filtering criteria), which may not capture the full extent of each species that other sources such as local vector surveillance data might capture. Third, since we were focused on comparing laboratory-derived thermal performance curves to field occurrence, we used temperatures measured during the activity season as predictors in XGBoost models for four out of seven species. Despite the importance of activity season temperature, other climatic variables such as winter low temperatures or dry-season aridity may be equally or more important limitations on species distributions. Fourth, the species distribution modeling framework assumes that species occupy their full climatic niche, but this may not be the case for species that are actively expanding their ranges. Finally, our analysis is based on a core assumption that using the 2000–2019 average of covariates can accurately characterize the average ecological or habitat preferences of a mosquito reported in a specific date and location.
Conclusions
As climate change shifts the geographic and seasonal distribution of environmental conditions, it is critical to understand how temperature limits species ranges. Temperature in particular affects vector-borne diseases because of its effects on vector biting rate, parasite incubation rate, survival, and other life history traits. However, the thermal constraints on vector occurrence are less well understood, and particularly how they vary among important vector species. Here, we showed that temperature mean and variation during the activity season provide important constraints on the ranges of seven important mosquito vectors, and that thermal minima and, to a lesser degree, thermal optima observed in the field are closely correlated to those measured in laboratory experimental studies. This finding suggests that species distribution models can, to some extent, contribute to understanding the thermal biology of organisms that cannot be studied in a laboratory setting. Importantly, statistical species distribution models derived from field observations did not clearly identify upper thermal constraints even though these have been directly demonstrated experimentally in the laboratory. Climate change is likely to drive many regions past the currently observed range of temperatures. This highlights the critical need for mechanistic, trait-based studies that capture temperature ranges at which mosquito abundance and occurrence begin to decline [6,106,107] and, at the very least, emphasizes that extrapolations from statistical models based on current species distributions should be validated using physiological models [35,107,108].
Supporting information
Acknowledgments
The authors thank Marta Shocket and Oswaldo Villena for providing mosquito life history trait data, and Marianne Sinka and Joshua Longbottom for supplying additional species occurrence points for Anopheles stephensi.
Data Availability
Data and code can be found at https://github.com/tathni/mosquito-sdm.
Funding Statement
TSA is supported by the National Institute of General Medical Sciences (grant no. T32GM144273). MLC was supported by the through the Illich-Sadowsky Interdisciplinary Graduate Fellowship program at Stanford University and an Environmental Fellowship at the Harvard University Center for the Environment. EAM and CKG were supported by the National Science Foundation and the Fogarty International Center (grant no. DEB-2011147). EAM was additionally supported by the National Institute of Allergy and Infectious Diseases (grant nos R01AI168097 and R01AI102918), the National Institutes of Health (grant no. R35GM133439), and by seed grants from the Stanford Woods Institute for the Environment, King Center on Global Development, Center for Innovation in Global Health and Terman Award. CKG was additionally supported by a Stanford Institute for Human-centered Artificial Intelligence Postdoctoral Fellowship. The funders did not play a role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Funder websites in include: https://www.nigms.nih.gov/ https://vpge.stanford.edu/fellowships-funding/sigf/sigf-named-fellowships https://environment.harvard.edu/environmental-fellows-program https://www.nih.gov/about-nih/what-we-do/nih-almanac/fogarty-international-center-fic https://www.niaid.nih.gov/ https://www.nih.gov/ https://kingcenter.stanford.edu/ https://globalhealth.stanford.edu/ https://biology.stanford.edu/news/erin-mordecai-receives-terman-award https://hai.stanford.edu/research/fellowship-programs https://woods.stanford.edu/.
References
- 1.Bhatt S, Gething PW, Brady OJ, Messina JP, Farlow AW, Moyes CL, et al. The global distribution and burden of dengue. Nature. 2013;496: 504–507. doi: 10.1038/nature12060 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Gaythorpe KAM, Hamlet A, Jean K, Garkauskas Ramos D, Cibrelus L, Garske T, et al. The global burden of yellow fever. Elife. 2021;10. doi: 10.7554/elife.64670 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Murray CJL, Rosenfeld LC, Lim SS, Andrews KG, Foreman KJ, Haring D, et al. Global malaria mortality between 1980 and 2010: a systematic analysis. Lancet. 2012;379: 413–431. doi: 10.1016/S0140-6736(12)60034-8 [DOI] [PubMed] [Google Scholar]
- 4.Puntasecca CJ, King CH, LaBeaud AD. Measuring the global burden of chikungunya and Zika viruses: A systematic review. PLoS Negl Trop Dis. 2021;15: e0009055. doi: 10.1371/journal.pntd.0009055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Stanaway JD, Shepard DS, Undurraga EA, Halasa YA, Coffeng LE, Brady OJ, et al. The global burden of dengue: an analysis from the Global Burden of Disease Study 2013. Lancet Infect Dis. 2016;16: 712–723. doi: 10.1016/S1473-3099(16)00026-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mordecai EA, Paaijmans KP, Johnson LR, Balzer C, Ben-Horin T, de Moor E, et al. Optimal temperature for malaria transmission is dramatically lower than previously predicted. Ecol Lett. 2013;16: 22–30. doi: 10.1111/ele.12015 [DOI] [PubMed] [Google Scholar]
- 7.Paaijmans KP, Heinig RL, Seliga RA, Blanford JI, Blanford S, Murdock CC, et al. Temperature variation makes ectotherms more sensitive to climate change. Glob Chang Biol. 2013;19: 2373–2380. doi: 10.1111/gcb.12240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kraemer MUG, Reiner RC Jr, Brady OJ, Messina JP, Gilbert M, Pigott DM, et al. Past and future spread of the arbovirus vectors Aedes aegypti and Aedes albopictus. Nat Microbiol. 2019;4: 854–863. doi: 10.1038/s41564-019-0376-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Messina JP, Brady OJ, Golding N, Kraemer MUG, Wint GRW, Ray SE, et al. The current and future global distribution and population at risk of dengue. Nat Microbiol. 2019;4: 1508–1515. doi: 10.1038/s41564-019-0476-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Parham PE, Michael E. Modeling the effects of weather and climate change on malaria transmission. Environ Health Perspect. 2010;118: 620–626. doi: 10.1289/ehp.0901256 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rochlin I, Ninivaggi DV, Hutchinson ML, Farajollahi A. Climate change and range expansion of the Asian tiger mosquito (Aedes albopictus) in Northeastern USA: implications for public health practitioners. PLoS One. 2013;8: e60874. doi: 10.1371/journal.pone.0060874 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Focks DA, Haile DG, Daniels E, Mount GA. Dynamic life table model for Aedes aegypti (Diptera: Culicidae): analysis of the literature and model development. J Med Entomol. 1993;30: 1003–1017. doi: 10.1093/jmedent/30.6.1003 [DOI] [PubMed] [Google Scholar]
- 13.Rueda LM, Patel KJ, Axtell RC, Stinner RE. Temperature-dependent development and survival rates of Culex quinquefasciatus and Aedes aegypti (Diptera: Culicidae). J Med Entomol. 1990;27: 892–898. doi: 10.1093/jmedent/27.5.892 [DOI] [PubMed] [Google Scholar]
- 14.Shapiro LLM, Whitehead SA, Thomas MB. Quantifying the effects of temperature on mosquito and parasite traits that determine the transmission potential of human malaria. PLoS Biol. 2017;15: e2003489. doi: 10.1371/journal.pbio.2003489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tun-Lin W, Burkot TR, Kay BH. Effects of temperature and larval diet on development rates and survival of the dengue vector Aedes aegypti in north Queensland, Australia. Med Vet Entomol. 2000;14: 31–37. doi: 10.1046/j.1365-2915.2000.00207.x [DOI] [PubMed] [Google Scholar]
- 16.Mordecai EA, Caldwell JM, Grossman MK, Lippi CA, Johnson LR, Neira M, et al. Thermal biology of mosquito-borne disease. Ecol Lett. 2019;22: 1690–1708. doi: 10.1111/ele.13335 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Amarasekare P, Savage V. A framework for elucidating the temperature dependence of fitness. Am Nat. 2012;179: 178–191. doi: 10.1086/663677 [DOI] [PubMed] [Google Scholar]
- 18.Huey RB, Berrigan D. Temperature, demography, and ectotherm fitness. Am Nat. 2001;158: 204–210. doi: 10.1086/321314 [DOI] [PubMed] [Google Scholar]
- 19.Brown JH, Gillooly JF, Allen AP, Savage VM, West GB. Toward a metabolic theory of ecology. Ecology. 2004;85: 1771–1789. doi: 10.1890/03-9000 [DOI] [Google Scholar]
- 20.Craig MH, Snow RW, le Sueur D. A climate-based distribution model of malaria transmission in sub-Saharan Africa. Parasitol Today. 1999;15: 105–111. doi: 10.1016/s0169-4758(99)01396-4 [DOI] [PubMed] [Google Scholar]
- 21.Johansson MA, Powers AM, Pesik N, Cohen NJ, Staples JE. Nowcasting the spread of Chikungunya virus in the Americas. PLoS One. 2014;9: e104915. doi: 10.1371/journal.pone.0104915 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kushmaro A, Friedlander TA, Levins R. Temperature effects on the basic reproductive number (R0) of west Nile virus, based on ecological parameters: Endemic vs. New emergence regions. J Trop Dis. 2015;s1. doi: 10.4172/2329-891x.1000s1-001 [DOI] [Google Scholar]
- 23.Lafferty KD. The ecology of climate change and infectious diseases. Ecology. 2009;90: 888–900. doi: 10.1890/08-0079.1 [DOI] [PubMed] [Google Scholar]
- 24.Liu-Helmersson J, Stenlund H, Wilder-Smith A, Rocklöv J. Vectorial capacity of Aedes aegypti: effects of temperature and implications for global dengue epidemic potential. PLoS One. 2014;9: e89783. doi: 10.1371/journal.pone.0089783 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mordecai EA, Cohen JM, Evans MV, Gudapati P, Johnson LR, Lippi CA, et al. Detecting the impact of temperature on transmission of Zika, dengue, and chikungunya using mechanistic models. PLoS Negl Trop Dis. 2017;11: e0005568. doi: 10.1371/journal.pntd.0005568 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Morin CW, Monaghan AJ, Hayden MH, Barrera R, Ernst K. Meteorologically driven simulations of dengue epidemics in San Juan, PR. PLoS Negl Trop Dis. 2015;9: e0004002. doi: 10.1371/journal.pntd.0004002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Paull SH, Horton DE, Ashfaq M, Rastogi D, Kramer LD, Diffenbaugh NS, et al. Drought and immunity determine the intensity of West Nile virus epidemics and climate change impacts. Proc Biol Sci. 2017;284: 20162078. doi: 10.1098/rspb.2016.2078 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Shocket MS, Ryan SJ, Mordecai EA. Temperature explains broad patterns of Ross River virus transmission. Elife. 2018;7. doi: 10.7554/eLife.37762 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Shocket MS, Verwillow AB, Numazu MG, Slamani H, Cohen JM, El Moustaid F, et al. Transmission of West Nile and five other temperate mosquito-borne viruses peaks at temperatures between 23°C and 26°C. Elife. 2020;9. doi: 10.7554/elife.58511 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tesla B, Demakovsky LR, Mordecai EA, Ryan SJ, Bonds MH, Ngonghala CN, et al. Temperature drives Zika virus transmission: evidence from empirical and mathematical models. Proc Biol Sci. 2018;285: 20180795. doi: 10.1098/rspb.2018.0795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Villena OC, Ryan SJ, Murdock CC, Johnson LR. Temperature impacts the environmental suitability for malaria transmission by Anopheles gambiae and Anopheles stephensi. Ecology. 2022;103: e3685. doi: 10.1002/ecy.3685 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Vogels CBF, Hartemink N, Koenraadt CJM. Modelling West Nile virus transmission risk in Europe: effect of temperature and mosquito biotypes on the basic reproduction number. Sci Rep. 2017;7: 5022. doi: 10.1038/s41598-017-05185-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wesolowski A, Qureshi T, Boni MF, Sundsøy PR, Johansson MA, Rasheed SB, et al. Impact of human mobility on the emergence of dengue epidemics in Pakistan. Proc Natl Acad Sci U S A. 2015;112: 11887–11892. doi: 10.1073/pnas.1504964112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Barve N, Barve V, Jiménez-Valverde A, Lira-Noriega A, Maher SP, Peterson AT, et al. The crucial role of the accessible area in ecological niche modeling and species distribution modeling. Ecol Modell. 2011;222: 1810–1819. doi: 10.1016/j.ecolmodel.2011.02.011 [DOI] [Google Scholar]
- 35.Elith J, Leathwick JR. Species distribution models: Ecological explanation and prediction across space and time. Annu Rev Ecol Evol Syst. 2009;40: 677–697. doi: 10.1146/annurev.ecolsys.110308.120159 [DOI] [Google Scholar]
- 36.Peterson AT, Soberón J. Species distribution modeling and ecological niche modeling: Getting the concepts right. Nat Conserv. 2012;10: 102–107. doi: 10.4322/natcon.2012.019 [DOI] [Google Scholar]
- 37.Soberon J, Peterson AT. Interpretation of models of fundamental ecological niches and species’ distributional areas. Biodivers Inf. 2005;2. doi: 10.17161/bi.v2i0.4 [DOI] [Google Scholar]
- 38.Kanagaraj R, Araujo MB, Barman R, Davidar P, De R, Digal DK, et al. Predicting range shifts of Asian elephants under global change. Divers Distrib. 2019;25: 822–838. doi: 10.1111/ddi.12898 [DOI] [Google Scholar]
- 39.Peers MJL, Thornton DH, Murray DL. Reconsidering the specialist-generalist paradigm in niche breadth dynamics: resource gradient selection by Canada lynx and bobcat. PLoS One. 2012;7: e51488. doi: 10.1371/journal.pone.0051488 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Leahy L, Scheffers BR, Williams SE, Andersen AN. Diversity and distribution of the dominant ant genus Anonychomyrma (Hymenoptera: Formicidae) in the Australian Wet Tropics. Diversity (Basel). 2020;12: 474. doi: 10.3390/d12120474 [DOI] [Google Scholar]
- 41.Gogol-Prokurat M. Predicting habitat suitability for rare plants at local spatial scales using a species distribution model. Ecol Appl. 2011;21: 33–47. doi: 10.1890/09-1190.1 [DOI] [PubMed] [Google Scholar]
- 42.Edwards JL. Research and societal benefits of the global biodiversity information facility. Bioscience. 2004;54: 486. doi: 10.1641/0006-3568(2004)054[0486:rasbot]2.0.co;2 [DOI] [Google Scholar]
- 43.Kraemer MUG, Sinka ME, Duda KA, Mylne AQN, Shearer FM, Barker CM, et al. The global distribution of the arbovirus vectors Aedes aegypti and Ae. albopictus. Elife. 2015;4: e08347. doi: 10.7554/eLife.08347 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Messina JP, Pigott DM, Golding N, Duda KA, Brownstein JS, Weiss DJ, et al. The global distribution of Crimean-Congo hemorrhagic fever. Trans R Soc Trop Med Hyg. 2015;109: 503–513. doi: 10.1093/trstmh/trv050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Wisz MS, Pottier J, Kissling WD, Pellissier L, Lenoir J, Damgaard CF, et al. The role of biotic interactions in shaping distributions and realised assemblages of species: implications for species distribution modelling. Biol Rev Camb Philos Soc. 2013;88: 15–30. doi: 10.1111/j.1469-185X.2012.00235.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Akpan GE, Adepoju KA, Oladosu OR, Adelabu SA. Dominant malaria vector species in Nigeria: Modelling potential distribution of Anopheles gambiae sensu lato and its siblings with MaxEnt. PLoS One. 2018;13: e0204233. doi: 10.1371/journal.pone.0204233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Conley AK, Fuller DO, Haddad N, Hassan AN, Gad AM, Beier JC. Modeling the distribution of the West Nile and Rift Valley Fever vector Culex pipiens in arid and semi-arid regions of the Middle East and North Africa. Parasit Vectors. 2014;7: 289. doi: 10.1186/1756-3305-7-289 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fatima SH, Atif S, Rasheed SB, Zaidi F, Hussain E. Species Distribution Modelling of Aedes aegypti in two dengue-endemic regions of Pakistan. Trop Med Int Health. 2016;21: 427–436. doi: 10.1111/tmi.12664 [DOI] [PubMed] [Google Scholar]
- 49.Mweya CN, Kimera SI, Kija JB, Mboera LEG. Predicting distribution ofAedes aegyptiandCulex pipienscomplex, potential vectors of Rift Valley fever virus in relation to disease epidemics in East Africa. Infect Ecol Epidemiol. 2013;3: 21748. doi: 10.3402/iee.v3i0.21748 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zhang C, Wei D, Shi G, Huang X, Cheng P, Liu G, et al. Understanding the regulation of overwintering diapause molecular mechanisms in Culex pipiens pallens through comparative proteomics. Sci Rep. 2019;9: 6485. doi: 10.1038/s41598-019-42961-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lehmann T, Dao A, Yaro AS, Adamou A, Kassogue Y, Diallo M, et al. Aestivation of the African malaria mosquito, Anopheles gambiae in the Sahel. Am J Trop Med Hyg. 2010;83: 601–606. doi: 10.4269/ajtmh.2010.09-0779 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Poelchau MF, Reynolds JA, Elsik CG, Denlinger DL, Armbruster PA. RNA-Seq reveals early distinctions and late convergence of gene expression between diapause and quiescence in the Asian tiger mosquito, Aedes albopictus. J Exp Biol. 2013;216: 4082–4090. doi: 10.1242/jeb.089508 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Rinehart JP, Robich RM, Denlinger DL. Enhanced cold and desiccation tolerance in diapausing adults of Culex pipiens, and a role for Hsp70 in response to cold shock but not as a component of the diapause program. J Med Entomol. 2006;43: 713–722. doi: 10.1603/0022-2585(2006)43[713:ECADTI]2.0.CO;2 [DOI] [PubMed] [Google Scholar]
- 54.Robich RM, Denlinger DL. Diapause in the mosquito Culex pipiens evokes a metabolic switch from blood feeding to sugar gluttony. Proc Natl Acad Sci U S A. 2005;102: 15912–15917. doi: 10.1073/pnas.0507958102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Yaro AS, Traoré AI, Huestis DL, Adamou A, Timbiné S, Kassogué Y, et al. Dry season reproductive depression of Anopheles gambiae in the Sahel. J Insect Physiol. 2012;58: 1050–1059. doi: 10.1016/j.jinsphys.2012.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kazadi S-N, Kaoru F. Interannual and long-term climate variability over the Zaire River Basin during the last 30 years. J Geophys Res. 1996;101: 21351–21360. doi: 10.1029/96jd01869 [DOI] [Google Scholar]
- 57.Mitchell CJ. Geographic spread of Aedes albopictus and potential for involvement in arbovirus cycles in the Mediterranean basin. J Vector Ecol. 1995;20: 44–58. [Google Scholar]
- 58.Chang L-H, Hsu E-L, Teng H-J, Ho C-M. Differential survival of Aedes aegypti and Aedes albopictus (Diptera: Culicidae) larvae exposed to low temperatures in Taiwan. J Med Entomol. 2007;44: 205–210. doi: 10.1603/0022-2585(2007)44[205:dsoaaa]2.0.co;2 [DOI] [PubMed] [Google Scholar]
- 59.Lacour G, Chanaud L, L’Ambert G, Hance T. Seasonal Synchronization of Diapause Phases in Aedes albopictus (Diptera: Culicidae). PLoS One. 2015;10: e0145311. doi: 10.1371/journal.pone.0145311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sinka ME, Bangs MJ, Manguin S, Chareonviriyaphap T, Patil AP, Temperley WH, et al. The dominant Anopheles vectors of human malaria in the Asia-Pacific region: occurrence data, distribution maps and bionomic précis. Parasit Vectors. 2011;4: 89. doi: 10.1186/1756-3305-4-89 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Meuti ME, Short CA, Denlinger DL. Mom matters: Diapause characteristics of Culex pipiens-Culex quinquefasciatus (Diptera: Culicidae) hybrid mosquitoes. J Med Entomol. 2015;52: 131–137. doi: 10.1093/jme/tju016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Almirón WR, Brewer ME. Winter biology of Culex pipiens quinquefasciatus say, (Diptera: Culicidae) from Córdoba, Argentina. Mem Inst Oswaldo Cruz. 1996;91: 649–654. doi: 10.1590/s0074-02761996000500019 [DOI] [Google Scholar]
- 63.Buth JL, Brust RA, Ellis RA. Development time, oviposition activity and onset of diapause in Culex tarsalis, Culex restuans and Culiseta inornata in southern Manitoba. J Am Mosq Control Assoc. 1990;6: 55–63. [PubMed] [Google Scholar]
- 64.Brown JJ, Pascual M, Wimberly MC, Johnson LR, Murdock CC. Humidity-The overlooked variable thermal biology mosquito-borne disease. Ecology Letters. 2023;26: 1029–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Sinka ME, Pironon S, Massey NC, Longbottom J, Hemingway J, Moyes CL, et al. A new malaria vector in Africa: Predicting the expansion range of Anopheles stephensi and identifying the urban populations at risk. Proc Natl Acad Sci U S A. 2020;117: 24900–24908. doi: 10.1073/pnas.2003976117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Wiebe A, Longbottom J, Gleave K, Shearer FM, Sinka ME, Massey NC, et al. Geographical distributions of African malaria vector sibling species and evidence for insecticide resistance. Malar J. 2017;16. doi: 10.1186/s12936-017-1734-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Barbet-Massin M, Rome Q, Villemant C, Courchamp F. Can species distribution models really predict the expansion of invasive species? PLoS One. 2018;13: e0193085. doi: 10.1371/journal.pone.0193085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Kramer-Schadt S, Niedballa J, Pilgrim JD, Schröder B, Lindenborn J, Reinfelder V, et al. The importance of correcting for sampling bias in MaxEnt species distribution models. Divers Distrib. 2013;19: 1366–1379. doi: 10.1111/ddi.12096 [DOI] [Google Scholar]
- 69.Liu C, White M, Newell G. Selecting thresholds for the prediction of species occurrence with presence-only data. J Biogeogr. 2013;40: 778–789. doi: 10.1111/jbi.12058 [DOI] [Google Scholar]
- 70.Chevillon C, Pasteur N, Marquine M, Heyse D, Raymond M. Population structure and dynamics of selected genes in the mosquito Culex pipiens. Evolution. 1995;49: 997. doi: 10.1111/j.1558-5646.1995.tb02334.x [DOI] [PubMed] [Google Scholar]
- 71.Verdonschot PFM, Besse-Lototskaya AA. Flight distance of mosquitoes (Culicidae): A metadata analysis to support the management of barrier zones around rewetted and newly constructed wetlands. Limnologica. 2014;45: 69–79. doi: 10.1016/j.limno.2013.11.002 [DOI] [Google Scholar]
- 72.Li SL, Acosta AL, Hill SC, Brady OJ, de Almeida MAB, Cardoso J da C, et al. Mapping environmental suitability of Haemagogus and Sabethes spp. mosquitoes to understand sylvatic transmission risk of yellow fever virus in Brazil. PLoS Negl Trop Dis. 2022;16: e0010019. doi: 10.1371/journal.pntd.0010019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Natekin A, Knoll A. Gradient boosting machines, a tutorial. Front Neurorobot. 2013;7. doi: 10.3389/fnbot.2013.00021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, et al. Xgboost: extreme gradient boosting. R Package Version. 2015;1: 1–4. [Google Scholar]
- 75.XGBoost Parameters—xgboost 2.1.1 documentation. [cited 5 Sep 2024]. Available: https://xgboost.readthedocs.io/en/stable/parameter.html
- 76.Goldstein A, Kapelner A, Bleich J, Pitkin E. Peeking inside the black box: Visualizing statistical learning with plots of individual conditional expectation. J Comput Graph Stat. 2015;24: 44–65. doi: 10.1080/10618600.2014.907095 [DOI] [Google Scholar]
- 77.Brady OJ, Johansson MA, Guerra CA, Bhatt S, Golding N, Pigott DM, et al. Modelling adult Aedes aegypti and Aedes albopictus survival at different temperatures in laboratory and field settings. Parasit Vectors. 2013;6. doi: 10.1186/1756-3305-6-351 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Cianci D, Hartemink N, Ibáñez-Justicia A. Modelling the potential spatial distribution of mosquito species using three different techniques. Int J Health Geogr. 2015;14: 10. doi: 10.1186/s12942-015-0001-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Bernhardt JR, Sunday JM, Thompson PL, O’Connor MI. Nonlinear averaging of thermal experience predicts population growth rates in a thermally variable environment. Proc Biol Sci. 2018;285: 20181076. doi: 10.1098/rspb.2018.1076 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Angilletta MJ, Angilletta MJ. Thermal Adaptation: Theoretical Empirical Synthesis. Oxford: OUP; 2009. [Google Scholar]
- 81.Dell AI, Pawar S, Savage VM. Systematic variation in the temperature dependence of physiological and ecological traits. Proc Natl Acad Sci U S A. 2011;108: 10591–10596. doi: 10.1073/pnas.1015178108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Deutsch CA, Tewksbury JJ, Huey RB, Sheldon KS, Ghalambor CK, Haak DC, et al. Impacts of climate warming on terrestrial ectotherms across latitude. Proc Natl Acad Sci U S A. 2008;105: 6668–6672. doi: 10.1073/pnas.0709472105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Somero GN. Thermal physiology and vertical zonation of intertidal animals: optima, limits, and costs of living. Integr Comp Biol. 2002;42: 780–789. doi: 10.1093/icb/42.4.780 [DOI] [PubMed] [Google Scholar]
- 84.Crozier L, Dwyer G. Combining population-dynamic and ecophysiological models to predict climate-induced insect range shifts. Am Nat. 2006;167: 853–866. doi: 10.1086/504848 [DOI] [PubMed] [Google Scholar]
- 85.Buckley LB, Urban MC, Angilletta MJ, Crozier LG, Rissler LJ, Sears MW. Can mechanism inform species’ distribution models? Ecol Lett. 2010;13: 1041–1054. doi: 10.1111/j.1461-0248.2010.01479.x [DOI] [PubMed] [Google Scholar]
- 86.Davis AJ, Jenkinson LS, Lawton JH, Shorrocks B, Wood S. Making mistakes when predicting shifts in species range in response to global warming. Nature. 1998;391: 783–786. doi: 10.1038/35842 [DOI] [PubMed] [Google Scholar]
- 87.Guisan A, Thuiller W. Predicting species distribution: offering more than simple habitat models. Ecol Lett. 2005;8: 993–1009. doi: 10.1111/j.1461-0248.2005.00792.x [DOI] [PubMed] [Google Scholar]
- 88.Heikkinen RK, Luoto M, Virkkala R, Pearson RG, Körber J-H. Biotic interactions improve prediction of boreal bird distributions at macro-scales. Glob Ecol Biogeogr. 2007;16: 754–763. doi: 10.1111/j.1466-8238.2007.00345.x [DOI] [Google Scholar]
- 89.Lounibos LP, Juliano SA. Where vectors collide: The importance of mechanisms shaping the realized niche for modeling ranges of invasive Aedes mosquitoes. Biol Invasions. 2018;20: 1913–1929. doi: 10.1007/s10530-018-1674-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Pulliam HR. On the relationship between niche and distribution. Ecol Lett. 2000;3: 349–361. doi: 10.1046/j.1461-0248.2000.00143.x [DOI] [Google Scholar]
- 91.Lambrechts L, Paaijmans KP, Fansiri T, Carrington LB, Kramer LD, Thomas MB, et al. Impact of daily temperature fluctuations on dengue virus transmission by Aedes aegypti. Proc Natl Acad Sci U S A. 2011;108: 7460–7465. doi: 10.1073/pnas.1101377108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Paaijmans KP, Read AF, Thomas MB. Understanding the link between malaria risk and climate. Proc Natl Acad Sci U S A. 2009;106: 13844–13849. doi: 10.1073/pnas.0903423106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Paaijmans KP, Blanford S, Bell AS, Blanford JI, Read AF, Thomas MB. Influence of climate on malaria transmission depends on daily temperature variation. Proc Natl Acad Sci U S A. 2010;107: 15135–15139. doi: 10.1073/pnas.1006422107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Gamliel I, Buba Y, Guy-Haim T, Garval T, Willette D, Rilov G, et al. Incorporating physiology into species distribution models moderates the projected impact of warming on selected Mediterranean marine species. Ecography (Cop). 2020;43: 1090–1106. doi: 10.1111/ecog.04423 [DOI] [Google Scholar]
- 95.Salah Mohammad Kheir Azzam Mohammad Alahmed MN, Sallam Mohamed Fahim. Ecological Distribution Modeling Two Malaria Mosquito Vectors Using Geographical Information System Al-Baha Province. Pak J Zool. 2015;47: 1797–1806. [Google Scholar]
- 96.Richman R, Diallo D, Diallo M, Sall AA, Faye O, Diagne CT, et al. Ecological niche modeling of Aedes mosquito vectors of chikungunya virus in southeastern Senegal. Parasit Vectors. 2018;11: 255. doi: 10.1186/s13071-018-2832-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Smeraldo S, Di Febbraro M, Bosso L, Flaquer C, Guixé D, Lisón F, et al. Ignoring seasonal changes in the ecological niche of non-migratory species may lead to biases in potential distribution models: lessons from bats. Biodivers Conserv. 2018;27: 2425–2441. doi: 10.1007/s10531-018-1545-7 [DOI] [Google Scholar]
- 98.Soberón JM. Niche and area of distribution modeling: a population ecology perspective. Ecography (Cop). 2010;33: 159–167. doi: 10.1111/j.1600-0587.2009.06074.x [DOI] [Google Scholar]
- 99.Nuñez-Penichet C, Cobos ME, Barro A, Soberón J. Potential migratory routes of Urania boisduvalii (Lepidoptera: Uraniidae) among host plant populations. Divers Distrib. 2019;25: 478–488. doi: 10.1111/ddi.12881 [DOI] [Google Scholar]
- 100.Khoury CK, Carver D, Barchenger DW, Barboza GE, Zonneveld M, Jarret R, et al. Modelled distributions conservation status wild relatives chile peppers (Capsicum L.). Diversity & Distributions. 2020;26: 209–225. [Google Scholar]
- 101.Lira-Noriega A, Soberón J, Equihua J. Potential invasion of exotic ambrosia beetles Xyleborus glabratus and Euwallacea sp. in Mexico: A major threat for native and cultivated forest ecosystems. Sci Rep. 2018;8. doi: 10.1038/s41598-018-28517-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Mertens A, Swennen R, Rønsted N, Vandelook F, Panis B, Sachter-Smith G, et al. Conservation status assessment of banana crop wild relatives using species distribution modelling. Divers Distrib. 2021;27: 729–746. doi: 10.1111/ddi.13233 [DOI] [Google Scholar]
- 103.Cunze S, Koch LK, Kochmann J, Klimpel S. Aedes albopictus and Aedes japonicus—two invasive mosquito species with different temperature niches in Europe. Parasit Vectors. 2016;9: 573. doi: 10.1186/s13071-016-1853-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Kulkarni MA, Desrochers RE, Kerr JT. High resolution niche models of malaria vectors in northern Tanzania: A new capacity to predict malaria risk? PLoS One. 2010;5: e9396. doi: 10.1371/journal.pone.0009396 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Uusitalo R, Siljander M, Culverwell CL, Mutai NC, Forbes KM, Vapalahti O, et al. Predictive mapping of mosquito distribution based on environmental and anthropogenic factors in Taita Hills, Kenya. Int J Appl Earth Obs Geoinf. 2019;76: 84–92. doi: 10.1016/j.jag.2018.11.004 [DOI] [Google Scholar]
- 106.Merow C, Smith MJ, Silander JA Jr. A practical guide to MaxEnt for modeling species’ distributions: what it does, and why inputs and settings matter. Ecography (Cop). 2013;36: 1058–1069. doi: 10.1111/j.1600-0587.2013.07872.x [DOI] [Google Scholar]
- 107.Hijmans RJ, Graham CH. The ability of climate envelope models to predict the effect of climate change on species distributions. Glob Chang Biol. 2006;12: 2272–2281. doi: 10.1111/j.1365-2486.2006.01256.x [DOI] [Google Scholar]
- 108.Khatchikian C, Sangermano F, Kendell D, Livdahl T. Evaluation of species distribution model algorithms for fine-scale container-breeding mosquito risk prediction. Med Vet Entomol. 2011;25: 268–275. doi: 10.1111/j.1365-2915.2010.00935.x [DOI] [PMC free article] [PubMed] [Google Scholar]