Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2020 Jun 25;18(6):e3000633. doi: 10.1371/journal.pbio.3000633

Mapping trends in insecticide resistance phenotypes in African malaria vectors

Penelope A Hancock 1,*, Chantal J M Hendriks 1, Julie-Anne Tangena 2, Harry Gibson 1, Janet Hemingway 2, Michael Coleman 2, Peter W Gething 3,4, Ewan Cameron 1, Samir Bhatt 5, Catherine L Moyes 1,*
Editor: Andrew Fraser Read6
PMCID: PMC7316233  PMID: 32584814

Abstract

Mitigating the threat of insecticide resistance in African malaria vector populations requires comprehensive information about where resistance occurs, to what degree, and how this has changed over time. Estimating these trends is complicated by the sparse, heterogeneous distribution of observations of resistance phenotypes in field populations. We use 6,423 observations of the prevalence of resistance to the most important vector control insecticides to inform a Bayesian geostatistical ensemble modelling approach, generating fine-scale predictive maps of resistance phenotypes in mosquitoes from the Anopheles gambiae complex across Africa. Our models are informed by a suite of 111 predictor variables describing potential drivers of selection for resistance. Our maps show alarming increases in the prevalence of resistance to pyrethroids and DDT across sub-Saharan Africa from 2005 to 2017, with mean mortality following insecticide exposure declining from almost 100% to less than 30% in some areas, as well as substantial spatial variation in resistance trends.


Malaria control efforts are threatened by the emergence of insecticide resistance in mosquito vector populations. This study provides fine-resolution maps of insecticide resistance levels in malaria vectors for large parts of Sub-Saharan Africa; the maps quantify recent spatial trends in resistance and can assist in the implementation of resistance management practices.

Introduction

Insecticide resistance in African malaria vector populations has serious consequences for malaria prevention. Long-lasting insecticide-treated nets (LLINs) have achieved substantial reductions in malaria prevalence thus far in Africa [1], but the number of insecticides currently available for use in LLINs is very limited. Until recently, pyrethroids were the only class approved for use in LLINs, and recently launched new generation nets still use pyrethroids in combination with either an insect growth regulator, a pyrrole, or a synergist that inhibits the primary metabolic mechanism of pyrethroid resistance within mosquitoes [2,3]. A wider range of options is available for indoor residual spraying (IRS), but pyrethroids are less expensive than many alternatives and are still used for IRS in malaria-endemic sub-Saharan African countries [4,5].

Although there is evidence that pyrethroid resistance in African malaria vector populations is increasing [6,7], the wide array of field studies that are available do not provide a spatially comprehensive time series of resistance trends [8]. Quantifying these trends will improve our understanding of the historical spread of resistance and assist in designing insecticide resistance management strategies [9]. Comprehensive spatiotemporal analyses of resistance are also necessary to facilitate its inclusion in epidemiological models of malaria that inform decision-making at national and global levels [9]. Efforts to estimate trends in insecticide resistance are impeded by limitations associated with the available observations of resistance phenotypes in field mosquito populations. Observations from standardized susceptibility tests, which indicate the prevalence of phenotypic resistance in field populations, cover a wide geographic area and span several decades [8,10]. However, the spatial coverage of these data is sparse and heterogeneous, and resistance has rarely been monitored consistently over time, meaning that very few time series are available [9]. Moreover, these susceptibility tests have a large measurement error, and replication is required to robustly estimate resistance phenotypes.

Several studies have reported spatial variation in resistance within a country, but these studies haven’t analysed the spatial trends behind this variation [11,12]. In addition, time series of resistance data have shown changes over time at the location of sentinel sites [1320]; however, only one study has investigated temporal trends across Africa [7]. This study generated a single trend over time for the whole continent, for each species and insecticide, and did not investigate differences in trends between locations. No previous study has analysed spatial variation across multiple countries or investigated spatiotemporal trends within these regions.

Our capacity to understand and predict insecticide resistance can benefit from considering the variables that may influence selection for resistance. Sources of insecticides in the environment include the application of insecticide-based vector control interventions for public health—such as LLINs and IRS—and the application of agricultural insecticides, which include the same insecticide classes as those used in vector control [21]. LLIN coverage increased markedly across Africa from 2005 in response the global Roll Back Malaria initiative [22, 23], while IRS coverage has been restricted to much smaller areas [4]. Early insecticide-treated nets (ITNs) used permethrin or deltamethrin, whereas α-cypermethrin is now the most commonly used pyrethroid in LLINs. Over the last 20 years, deltamethrin, λ-cyhalothrin, and DDT have been used for IRS, with α-cypermethrin first used in mass campaigns in 2003. Since 2015, deltamethrin has been the only pyrethroid reported to be used in mass IRS campaigns along with DDT and other non-pyrethroid insecticides [4]. Several studies have demonstrated a local increase in insecticide resistance in field mosquito populations following the implementation of LLINs, IRS, or both [19,20,2426] although in other locations evidence of higher resistance after the introduction of these interventions was not found [24,27]. Associations between agricultural pesticide use and insecticide resistance have also been found [21,28], and there is evidence that pesticide contamination of water bodies is a source of selection pressure for resistance acting on mosquito larvae [29]. Relationships between resistance and drivers of selection will, however, vary geographically depending on population structure [30,31]. Genetic mechanisms of resistance also differ across mosquito species [25,30], and even closely related mosquito species have different ecological niches [32,33], as well as different blood feeding behaviour and preferences, meaning that they are likely to experience differences in insecticide exposure [34].

Pyrethroid resistance has not spread outwards from a single origin, and there are several known resistance mechanisms. The mechanisms that have been identified can be classed into target site mutations, up-regulation of detoxifying enzymes, and cuticular thickening [3538]. Each class is associated with multiple genetic variants that confer resistance, and there is evidence that at least some of these have arisen independently multiple times across Africa [39]. Once a new variant has arisen, increases in its frequency are driven by selection pressures that differ from place to place, for example, through the spatially varying coverage of pyrethroid use in public health and agriculture [4,22,40]. The spread of these genetic variants to new locations is driven by gene flow that also varies with location [30,4145]. Here, we consider a species complex, and the “spread” of resistance is further influenced by introgression of resistance genes from one species to another [4651]. The geospatial patterns in the pyrethroid resistance phenotype that we are studying here are, therefore, the result of a complex combination of multiple origins of multiple genetic variants that have then spread through gene flow and introgression.

There is evidence that some mechanisms of pyrethroid resistance differ in the level of resistance they confer to different compounds; for example, different depletion rates were found across 6 pyrethroid compounds when they were metabolised by 8 Anopheles cytochrome P450 enzymes under controlled conditions [52]. These differences have not been investigated for all mechanisms of resistance, and there is no evidence that the differences seen for individual resistance mechanisms tested in the laboratory translate into diverging patterns of resistance in the field.

To develop predictive models of insecticide resistance in field populations that can represent variable, nonlinear interactions with environmental, biological, and genetic variables, we utilise an ensemble modelling approach. The approach exploits the multifaceted strengths of different modelling methodologies, using machine-learning methods to extract predictive power from a set of covariates and then allowing a Bayesian geostatistical Gaussian process to model the autocorrelated residual variation [53]. Bayesian geostatistical models provide a robust model of residual autocorrelation that can be applied to spatiotemporal data with a heterogeneous sampling distribution [54]. Their application to observations from insecticide susceptibility tests conducted over a range of locations across Africa has previously demonstrated broad-scale associations between resistance to different types of pyrethroids, as well as the organochlorine DDT [55]. The models developed in this study exploit these associations in resistance across different insecticides to improve resistance predictions for individual insecticide types.

Using a database containing the results of standard insecticide susceptibility tests performed on mosquito samples collected throughout Africa [8], we extracted the results of 6,423 tests conducted on samples from the A. gambiae species complex, which are among the most important African malaria vectors. We used this data set in our model ensemble to quantify variation in the prevalence of resistance to pyrethroids and DDT over the period 2005–2017 by developing a series of predictive maps. Our models are informed by a suite of potential explanatory variables describing the coverage of insecticide-based vector control interventions, agriculture and other types of land cover, climate, processes determining the environmental fate of pesticides, and the distribution of the sibling species that make up the A. gambiae complex. Our results show dramatic changes in insecticide resistance phenotypes in malaria vector populations across Africa over a 13-year period and identify variables that were important in shaping these predictions.

Results

Spatiotemporal trends in the prevalence of insecticide resistance

Pyrethroid resistance

We investigated spatiotemporal trends in the prevalence of phenotypic resistance in the A. gambiae complex to 4 pyrethroids: deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin. Due to the lack of observations from central Africa, we partitioned the data into 2 separate spatial regions covering western and eastern parts of the continent and analysed each data subset independently by fitting separate models (Fig 1 and “Methods”). In West Africa, predicted mean prevalence of resistance to all pyrethroids increased dramatically over the period 2005–2017 (Figs 2 and 3 and S1 Fig, S2 Fig and S3 Fig). Predicted mean proportional mortality to deltamethrin was below 0.9 (the WHO threshold for confirmed resistance) across 15% (95% credible interval [CI] 13%–17%) of the west region in 2005, and across 98% (CI 96.6%–98.7%) of the region in 2017 (Fig 3). These changes in resistance were spatially heterogeneous (Fig 2). Increases in resistance to deltamethrin over the period—in terms of the reductions in the predicted mean proportional mortality—were greatest in northern Liberia (Fig 2D, line A); central Cote d’Ivoire (Fig 2D, line B); the area surrounding the border between Burkina Faso, Cote d’Ivoire, and Ghana (Fig 2D, line C); southern Ghana (Fig 2D, line D); and northern Gabon (Fig 2D, line E). In these regions, resistance to deltamethrin in 2017 was particularly high (with a mean proportional mortality below 0.3 (CI < 0.4).

Fig 1. The sampling distribution of the pyrethroid and DDT susceptibility test observations for mosquitoes from the A. gambiae species complex across space and time.

Fig 1

(A) The number of susceptibility test observations for each year and insecticide type. Numerical values are provided in S1 Data (10.6084/m9.figshare.9912623). (B) The sampling locations of the susceptibility test observations in panel A.

Fig 2. Predicted mean proportional mortality to deltamethrin across the west and east regions.

Fig 2

(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

Fig 3.

Fig 3

The proportion of the area with a predicted mean mortality to deltamethrin of less than 0.9, for the west region (red line) and the east region (blue line). Red and blue shaded areas indicate the 95% CI of the predicted proportion of pixels for the west and east regions, respectively. Numerical values are provided in S2 Data (10.6084/m9.figshare.9912623). CI, credible interval.

In East Africa, the prevalence of pyrethroid resistance also increased over the period 2005–2017, albeit at a lesser rate than that in the west region (Figs 2 and 3). Predicted mean proportional mortality to deltamethrin was below 0.9 across 9% (CI 3%–17%) of the east region in 2005 and across 45% (CI 38%–51%) of the region in 2017 (Fig 3). The greatest increases in pyrethroid resistance over the period occurred in the northern part of the region, in the area from central Ethiopia (Fig 2D, line F) westward across most of South Sudan (Fig 2D, line G), and extending into southern Sudan (Fig 2D, line H) and northern Uganda (Fig 2D, line I). Across most of this area, mean mortality to deltamethrin in 2017 was below 0.5 (CI < 0.75). Resistance to deltamethrin increased to a lesser extent in central and southern Uganda, western Kenya, eastern Ethiopia, and coastal Tanzania, with predicted mean mortalities of between 0.6 and 0.8 in these areas in 2017. In areas farther south, differences in predicted resistance over the time period were relatively slight, with mean mortalities changing by less than 0.15 from 2005 to 2017 within Malawi, Mozambique, Zimbabwe, and those parts of Zambia, Botswana, and South Africa that were included in the model. Similar spatiotemporal trends across the west and east regions occurred in predicted mean resistance to permethrin, λ-cyhalothrin, and α-cypermethrin (S1 Fig, S2 Fig and S3 Fig).

DDT resistance

Predicted mean resistance to DDT at the start of the period (in 2005) was more widespread in comparison to pyrethroid resistance and also increased throughout the region from 2005 to 2017 (Figs 4 and 5). In the west region, predicted mean proportional mortality to DDT was below 0.9 across 53% (CI 47%–59%) of the west region in 2005, and across 97% (CI 92.7%–99%) of the region in 2017 (Fig 5). Increases in resistance to DDT over the period were greatest in the area surrounding the border between Liberia and Guinea (Fig 4D, line A), southern Mali (Fig 4D, line B), and central Burkina Faso (Fig 4D, line C). The east region showed a weaker increase in predicted mean resistance to DDT over the period 2005–2017 in comparison to that occurring in the west region. Predicted mean proportional mortality was below 0.9 across 32% (CI 21%–44%) of the east region in 2005, and across 45% (CI 39%–51%) in 2017. Increases in DDT resistance over the period were greatest in central South Sudan (Fig 4D, line D).

Fig 4. Predicted mean proportional mortality to DDT across the west and east regions.

Fig 4

(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

Fig 5.

Fig 5

The proportion of the area with a predicted mean mortality to DDT of less than 0.9, for the west region (red line) and the east region (blue line). Red and blue shaded areas indicate the 95% CI of the predicted proportion of pixels for the west and east regions, respectively. Numerical values are provided in S2 Data (10.6084/m9.figshare.9912623). CI, credible interval.

Assessing prediction accuracy

We performed 10-fold out-of-sample validation on the model ensemble (re-running the model 10 times, withholding a different 10% portion of the data each time; see “Methods” and paper by Gelman and colleagues [56]) to assess the accuracy of predicted mean prevalence of resistance. Across all bioassay observations for pyrethroid insecticides, we obtained a root mean square error (RMSE) [57] of 0.179 (mean absolute error [MAE] = 0.127; S2 Table) across all the (out-of-sample) predictions of mean proportional mortality for the west and east regions combined (S5 Fig). Across all DDT bioassay observations, the corresponding out-of-sample RMSE was 0.167 (MAE = 0.111; S3 Table). We note that susceptibility tests have a high measurement error, which is reflected by the high values of the data noise parameter estimated by the fitted model [58] (S1 Table) and that the model aims to distinguish spatiotemporal variation in mean resistance that is independent of data noise.

The individual model constituents of our ensemble included 3 machine-learning models: an extreme gradient boosting model (XGB), a random forest model (RF), and a boosted generalized additive model (BGAM). We confirmed that our model ensemble showed improved predictive performance relative to each of these constituent models, as expected (S2 Table and S3 Table). Of the 3 machine-learning models, XGB had the lowest out-of-sample prediction error followed by RF and then BGAM. The fitted mean model weights given by the ensemble model were higher for models with lower out-of-sample prediction error, as expected (S4 Table).

In order to quantify prediction uncertainty across space and time, we produced maps of the 95% CIs of the posterior distributions of predicted mean mortality (Fig 6 and S7 Fig). We again performed 10-fold out-of-sample validation to assess the accuracy of the predicted CIs and found the coverage of the CIs to be accurate when the measurement error associated with the data—estimated by the fitted model [58]—was accounted for (S6 Fig). Prediction error is heterogeneous across space and time, with the 95% CIs of predicted mean mortality being higher in the east compared with the west region and with particularly high CIs in the northwestern part of the east region. This reflects the more sparse distribution of bioassay sampling locations in the east region, particularly in South Sudan and much of southern Sudan (Fig 1).

Fig 6. The prediction error (95% CI) associated with predicted mean mortality to deltamethrin.

Fig 6

See 10.6084/m9.figshare.9912623. CI, credible interval.

Regional and local resistance trends

National temporal trends in the predicted prevalence of resistance are qualitatively similar to those occurring in the broader east and west regions, particularly for countries in the west region, which all show a monotonic increase over time in the proportional area with a mean mortality to deltamethrin of <0.9 (Fig 7). Predicted temporal trends are more variable across the countries in the east region, with trends in Kenya, Malawi, Rwanda, and Tanzania suggesting either a plateau or a decline in resistance levels towards the end of the time period (Fig 7). Moreover, predicted temporal trends across point locations show greater variability than regional or national trends, and nonmonotonicity in point trends is common, with declines in resistance occurring following earlier increases (S8 Fig and S9 Fig). Attenuations and declines in resistance may reflect fitness costs of resistance, or they may also arise due to shifts in the composition of the sibling species that make up the A. gambiae complex, or immigration of mosquitoes from areas where resistance is lower (see “Discussion”).

Fig 7.

Fig 7

The proportion of the area with a predicted mean mortality to deltamethrin of less than 0.9 within each country that is fully contained within the west region (red lines) and the east region (blue lines). Red and blue shaded areas indicate the 95% CI of the predicted proportion of pixels for the west and east regions, respectively. Numerical values are provided in S3 Data and S4 Data (10.6084/m9.figshare.9912623). CI, credible interval.

Influential predictor variables

Our models used over 100 potential explanatory variables (see the “Methods” section), and our results show which of these variables were most influential to the predictions of mean prevalence of resistance. We obtained measures of variable importance for each of the 3 constituent machine-learning models (XGB, RF, and BGAM). Variable importance measures describe the influence of a variable on model predictions relative to the other predictor variables, but they can be hard to interpret when predictor variables are correlated (see S1 Text), and they do not identify causal relationships (see “Methods” and “Discussion”). For each model, the importance of each variable is expressed as a fraction of the total importance across all predictor variables. In ranking variable importance, we weighted the importance of each variable given by each model by the model’s weight obtained from the Gaussian process meta-model for pyrethroids (S4 Table). This increasingly weights those variables that were more important to models that performed better and thus made a higher relative contribution to the predictions made by the ensemble (Fig 8). Thus, the variable importance values given by XGB and RF are up-weighted relative to those given by BGAM. The original variable importance values produced by each model are given in S5 Table and S6 Table, and a description of each predictor variable is given in S9 Table.

Fig 8. Weighted variable importance of predictor variables given by the three machine-learning models included in the model ensemble.

Fig 8

(A) West Africa; (B) East Africa. Stacked bars show the relative variable importance given by XGB (blue), RF (green), and BGAM (grey), weighted by the fitted weight for each model given by the Gaussian process meta-model (see text). Variables are ranked by the total height of the stacked bars across the 3 models, and the top 20 variables are shown. The original variable importance values produced by each model are given in S5 Table and S6 Table, and definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2), and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two, and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step (see “Methods”). BGAM, boosted generalized additive model; IRS, indoor residual spraying; ITN, insecticide-treated net; PET, potential evapotranspiration; RF, random forest model; XGB, extreme gradient boosting model.

For the west region, variables describing the coverage of ITNs had the highest importance value for each of the 3 models. For XGB and RF, the 3-year lag of ITN coverage had the highest importance value. For BGAM, non-lagged ITN coverage had the highest importance value, and the 3-year lag of ITN coverage had the second highest importance value (Fig 8 and S5 Table). Outside the top two, variables describing climate processes and those describing the area of harvested crops are highly ranked (within the top 20 most important variables) for all 3 models (Fig 8 and S5 Table). For the east region, variables describing ITN coverage and rainfall were ranked in the top 10 most important variables for all 3 models (Fig 8 and S6 Table). More broadly, variables describing climate processes were highly ranked by all 3 models. Our ability to quantitatively compare differences in importance across our set of predictor variables is, however, inhibited by differences in the definition of variable importance used in the different machine-learning approaches that we have employed (see “Methods”).

Discussion

Here, we have quantified spatial and temporal trends in insecticide resistance in the A. gambiae species complex in East and West Africa, showing marked increases in the prevalence of resistance to pyrethroids and DDT in recent years, as well as geographic expansion. These results highlight the urgency of identifying and implementing effective resistance management strategies. Our predictive maps of mean prevalence of resistance are available to visualise alongside the latest susceptibility test data on the insecticide resistance mapper website (http://www.irmapper.com) and can guide decisions about resistance management at regional and local levels. In making recommendations, our results will need to be considered in combination with (i) data from resistance monitoring of field samples, including other malaria vector species such as A. funestus; (ii) data on the presence of underlying mechanisms of resistance; and (iii) analyses of the expected impacts of resistance management strategies on malaria prevalence [9,59]. Decision-making frameworks also need to explicitly incorporate predictive uncertainty, which is facilitated by our out-of-sample validation results and our mapped Bayesian CIs. Our predictions are not a substitute for ongoing resistance monitoring requirements but highlight areas with particularly high levels of predictive uncertainty, such as parts of South Sudan, southern Sudan, and the Democratic Republic of Congo (Fig 6D). In these areas, field sampling to measure resistance is the only means of informing resistance management decisions.

Our results show substantial variation in resistance trends between East and West Africa, as well as within these two regions. Interestingly, ITN coverage was identified as a relatively influential predictor in our models, which is consistent with other studies that have found significant, but spatially variable, increases in pyrethroid resistance associated with the introduction of ITNs [24]. However, in several areas of the central and southern parts of East Africa, such as west Tanzania, ITN coverage has been relatively high (>50%) from 2012 to 2017 [22], but predicted pyrethroid resistance in 2017 was relatively low (Fig 2D). This may be influenced by the locations where resistance mechanisms first emerged, patterns of subsequent gene flow (including restricted flow across the Rift Valley) [41,60,61], and differences among the sibling species within the A. gambiae complex [25,30]. For example, the distribution of A. arabiensis extends further than other species in the complex [33], and this species is known to be more plastic in its feeding behaviour, biting outdoors and feeding on cattle [33]. Therefore, it is possible that selection for resistance in this species lags behind other members of the complex [6264]. Our predictions of the prevalence of resistance are based on susceptibility tests that often do not identify the sibling species composition of the A. gambiae complex sample that was tested. Our analysis only includes test results that are representative of the original sample collected [8, 55], and our predictions cannot directly represent variation in the prevalence of resistance due to variation in the composition of sibling species [33,65]. Routine identification of the composition of sibling species in tested samples—and the provision of species-specific mortality values—would improve the capacity of susceptibility test data to inform prediction of resistance.

The coverage of pyrethroid IRS was not among the most influential predictors in our models, but only a small fraction of the areas that we modelled (<5% of the west region and <15% of the east region) received pyrethroid IRS between 2005 and 2017 [4]. Thus, our results do not imply that IRS is not important in driving the selection of resistance. IRS can, however, be a useful tool to prevent the spread of resistance and mitigate its effects because the number of options available for IRS mean chemical classes can be rotated through time, applied in a mosaic in space, or combined for use in the same place and time [9].

It is also important to note that, while our models included over 100 potential predictor variables that may influence selection for resistance, it is unlikely that we have captured the full set of causal variables underlying selection. In particular, data on the quantities of insecticides used in agriculture, and where they were applied, were not available [66]. Such information would better inform predictive relationships between resistance and agricultural insecticide use. We note that the relationships between insecticide resistance and the predictor variables represented in our models do not prove causality. Each variable interacts with other variables (S10 Fig and S11 Fig) and possibly with variables not included in our analysis. For example, variables describing climate processes were ranked as influential predictors (Fig 8), but these may delineate broad areas where resistance trends are similar as a result of an unmeasured process, such as mosquito population structure or species composition. More extensive data on the presence of resistance mechanisms—including a wider coverage of voltage-gated sodium channel (Vgsc) allele frequencies, as well as metabolic resistance markers [67]—in field populations will aid in predicting and interpreting resistance trends. The similarity in predicted spatiotemporal patterns in resistance across the 4 pyrethroids and DDT (e.g., Figs 2 and 4) suggests common underlying resistance mechanisms [55].

In some areas, predicted temporal trends indicate plateaus or declines in the prevalence of resistance following an increase. Interestingly, the areas that showed the strongest interannual declines in resistance over the time period often experienced strong increases in earlier years (S9 Fig). These oscillatory dynamics may be the result of fitness costs associated with the introduction of novel resistance mutations, but the extent of such costs in field A. gambiae populations is poorly understood [68]. Several resistance mechanisms exist in field mosquito populations, and they are complex polygenic processes that vary geographically and are continuing to evolve [68,69]. Declines in the prevalence of resistance may also result from a shift in the composition of sibling species; for example, increases in the proportion of A. arabiensis have been observed following the instigation of LLIN interventions that predominantly target A. gambiae, which bite humans indoors [70]. A future focus on African regions where resistance levels are still relatively low may be deliver new insights into how spread is initiated and how it can be mitigated.

While our analysis focuses on pyrethroids, insecticides from other classes such as carbamates and organophosphates are being increasingly used in IRS interventions [4]. The number of available susceptibility test results for insecticides from these classes is relatively low [8], and spatiotemporal analyses of resistance would benefit greatly from increasing the frequency and spatial coverage of sampling and testing. Susceptibility test data are also more limited for A. funestus, a major malaria vector in Africa that is widespread and among the dominant vector species [71]. While the available susceptibility test data for A. funestus are insufficient to support a geospatial analysis of the kind performed here, we note that the data indicate higher levels of pyrethroid resistance than we have seen for A. gambiae s.l. in southern parts of East Africa (Fig 9).

Fig 9. The sampling distribution of the pyrethroid susceptibility test observations for A. funestus mosquito species across space and time.

Fig 9

(A) The proportional mortality to pyrethroids in susceptibility tests performed on A. funestus in the years 1998–2017 (n = 692). Raw data are available in Moyes et al. 2019 [8]. (B) the locations of the samples in panel A and the recorded proportional mortality.

In summary, our results provide an Africa-wide perspective on recent trends in pyrethroid and DDT resistance in A. gambiae complex malaria vectors, demonstrating increasingly high prevalence of resistance to the main insecticides used in malaria control. The rapid spread of resistance across large parts of sub-Saharan Africa signals an urgent need to quantify the efficacy of different resistance management strategies and to understand the impact of resistance on malaria transmission and control. Relationships between insecticide resistance and malaria prevalence are currently poorly understood, but there is evidence that resistance can reduce the efficacy of standard pyrethroid-treated LLINs [3], which have played a key role in achieving reductions in malaria prevalence in Africa over 2000–2015 [1]. Our maps show marked broad-scale spatial heterogeneity in resistance, motivating the implementation and assessment of a wide range of strategies that target different insecticide resistance and malaria transmission settings, such as next-generation LLINs [3,9] as well as rotating insecticide use across different insecticide classes [9].

Methods

Data

Insecticide resistance bioassay data

Insecticide resistance bioassay data were obtained from a published database [8], which is an updated version of the data used by Hancock and colleagues [55] that includes samples tested up until the end of 2017. The data record the number of mosquitoes in the sample and the proportional sample mortality resulting from the bioassay, as well as variables describing the mosquitoes tested, the sample collection site, and the bioassay conditions and protocol. We used this information to select a subset of records for inclusion in our study (S1 Text). In summary, we include bioassay results for which standard WHO susceptibility tests or CDC bottle bioassays using any one of the 4 pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) or the organochlorine DDT was performed on mosquito samples belonging to the A. gambiae species complex. We include results from bioassays conducted over the period 2005–2017. Due to spatial heterogeneity in the sampling distribution, we confine our analysis to samples collected from within 2 separate geographic (west and east) regions of sub-Saharan Africa (see Fig 1 and S1 Text). We excluded Madagascar from our analysis, as our models of resistance on the mainland may not generalize well to island populations. The final number of proportional mortality observations across all insecticide types was 6,423 across 1,466 locations, with 3,515 and 2,908 observations in the west region and east region, respectively (S7 Table and S8 Table).

Vgsc allele frequency data

The Vgsc is the target site for both pyrethroids and DDT, and mutations in this channel confer resistance. Our analysis used data on the frequency of Vgsc mutations in mosquito samples belonging to the A. gambiae species complex collected from within the west and east regions over the period 2005–2017 [8,55]. These data record the combined frequency of the single point mutations L1014F and L1014S with respect to the wild-type allele L1014L and comprise 316 observations (215 observations for the west region and 101 observations for the east regions; S7 Table). As described subsequently, we incorporated these data into machine-learning models in order to inform prediction of phenotypic resistance to DDT and pyrethroids by exploiting the positive association between the frequency of Vgsc mutations and the prevalence of these resistance phenotypes [55].

Potential predictor variables

Our set of predictors includes 111 variables describing environmental characteristics that could potentially be related to the development and spread of insecticide resistance in populations of A. gambiae complex mosquito species (described in S9 Table and S1 Text). These variables describe the coverage of insecticide-based vector control interventions, agricultural land use [72,73], and the environmental fate of agricultural insecticides [66], other types of land use [72,7476], climate [72,77,78], and relative species abundance. Our vector control intervention data include a variable estimating the yearly coverage of ITNs [22,79] and a variable estimating the coverage of IRS with either pyrethroids or DDT year [4]. Relative species abundance is represented by a variable estimating the abundance of A. arabiensis relative to the abundance of A. gambiae and A. coluzzii [65]. For all variables, we obtained spatially explicit data on a grid with a 2.5 arc-minute resolution (which is approximately 5 km at the equator) covering sub-Saharan Africa. For variables for which temporal data were available on an annual resolution, we included time-lagged representations with lags of 0, 1, 2, and 3 years.

Gaussian process stacked generalization ensemble modelling approach

Stacked generalization is a method of combining an ensemble of models to produce a meta-model, with the aim of achieving better predictive performance than the individual model constituents [80,81]. Here, we adopt a stacking design whereby a set of individual models that make up the first layer, referred to as the “level 0 models,” feed into a single meta-model on the second layer, referred to as the “level 1 model.” We use the Gaussian process stacked generalization approach developed by Bhatt and colleagues [53], which uses Gaussian process regression as the level 1 model that combines weighted out-of-sample predictions from a set of multiple level 0 models derived from machine-learning methods. The approach exploits the known strengths of these different methodologies, using machine-learning methods to extract as much predictive power from the covariates as possible, and then allowing the Gaussian process to model the spatiotemporal error covariance structure, aiming to further improve prediction. Bhatt and colleagues [53] showed that, under the (restrictive) assumption that the true function is a part of the model’s function space, the use of the Gaussian process model of residual variation improves prediction accuracy compared with a standard constrained weighted mean across the ensemble predictions.

Machine-learning models

Our set of level 0 models consists of 3 different types of machine-learning model that predict insecticide resistance, using our bioassay mortality observations as the label and our suite of intervention, agriculture, and environmental covariates as features. The machine-learning approaches employed include an XGB model (implemented using the R package xgboost), an RF model (implemented using the R package randomForest), and a BGAM model (implemented using the R package mboost). We chose these methods because of their demonstrated high predictive performance, particularly in previous applications of Gaussian process stacked generalization to spatial processes [53]. The label for the level 0 models was the proportional mortality observations from bioassays conducted using the 4 pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin and α-cypermethrin), the proportional mortality observations for bioassays conducted using DDT, and the observations of the combined frequency of the Vgsc mutations L1014F and L1014S. We included in the label our data on the observed combined frequency of Vgsc mutations in mosquito samples, because these observations are significantly associated with the prevalence of resistance to DDT and pyrethroids [55] and can therefore inform prediction of these mortality values. Before performing parameter tuning on the level 0 models, we applied 2 data transformations to the label, the empirical logit transformation followed by the inverse hyperbolic sine (IHS) transformation [82].

The features used in the models included the 111 environmental predictor variables together with the 1-, 2-, and 3-year lags for those variables that vary temporally (on a yearly time step). A factor variable grouping the label according to the type of observation was also included as a feature, assigning a different group to bioassay observations depending on type of insecticide used and whether a WHO or CDC susceptibility test was used. This factor variable also assigned the Vgsc allele frequency observations to a separate group. Finally, the year in which the bioassay and allele frequency samples were collected was also included as a feature.

For each level 0 model, parameter tuning was performed using K-fold out-of-sample validation based on subdividing the data into K training and validation subsets (see S1 Text). In applying the XGB method, we used the DART boosting methodology to avoid overfitting [83].

Model stacking and Gaussian process regression

Let gA(si,t) denote the (empirical logit and IHS-transformed) proportional mortality record for a bioassay using insecticide type A conducted on a sample collected at geographic coordinates si and sampling time t. To implement Gaussian process stacked generalization, we model the transformed observations, denoted gA(si,t), using a Gaussian process regression formulation:

gA(sit)=wAMs,tA+fA(si,t)+eA (1)

in which wA is a constant vector, Ms,tA is a design matrix, fA(s,t) is a Gaussian process modelled by a spatiotemporal Gaussian Markov random field (GMRF) [84], and eA is Gaussian white noise N(0,σA2). We define a Bayesian hierarchical formulation for the model (Eq 1) using a vector of prior probability distributions for the hyperparameters θA = [wA,ψA,σA] in which ψA are the parameters of fA(s,t) (see S1 Text). To fit the model, the elements of the design matrix Ms,tA are set to the out-of-sample predictions of the level 0 models derived from K-fold cross-validation, i.e., Mi,pA=g˜A,p(sit), in which g˜A,p(sit) is the prediction of the iith withheld (transformed) observation gA(si,t) given by the pth level 0 model. Validation folds were randomly selected from the full data set. Posterior distributions of θA and fA(s,t) are then estimated by fitting the model (Eq 1) using the R-INLA package (www.r-inla.org) [85]. The posterior mean of the vector wA contains the fitted weights for each model, representing the relative contribution of each model to the predictions made by the model ensemble. Our implementation of Gaussian process regression (Eq 1) constrains each weight to be positive (wp≥0,∀p) [86]. Once the parameter estimation has been performed, the final set of predictions, g^A(s,t), given by the stacked model are obtained by replacing the elements of Ms,tA with the in-sample predictions of the l0 models obtained by fitting each of these models to all the data (all the labels and the corresponding sets of features) [53] (S1 Text).

Posterior validation

We performed posterior validation of the stacked model using 10-fold out-of-sample cross-validation, whereby the data were divided into 10 subsets (or “test” sets, using random sampling without replacement), and 10 successive model fits were performed, each withholding a different test set. Each test set was withheld from both the level 0 and level 1 models. We used these out-of-sample predictions to assess the accuracy of the predicted means of the observations as well as their predicted CIs (S1 Text). We also assessed the suitability of our assumed data-generating process using probability integral transform (PIT) histograms on out-of-sample data (S1 Text).

Predictor variable importance

We calculated measures of the importance of each predictor variable for each of the machine-learning models used in our model ensemble. For XGB, we used the gain measure calculated for each variable using the xgboost package [87], which is the fractional total reduction in the training error gained across all of that variable’s splits. For RF, we used the permutation importance measure calculated using the randomForest package [88], which is the fractional change in the out-of-bag error when the variable is randomly permuted. In the case of BGAM, we used the mboost package [89] to calculate variable importance as the total reduction in the training error across all boosting iterations in which that variable was chosen as the base learner. For each model, we express the importance of a single variable as a fraction of the total importance across all predictor variables in that model.

Code availability

R code for implementing the XGB, RF, and BGAM models, as well as the R-INLA models for Gaussian process stacked generalisation, is available on GitHub at 10.5281/zenodo.3751786 [90].

Supporting information

S1 Fig. Predicted mean proportional mortality to permethrin across the west and east regions.

(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

(TIF)

S2 Fig. Predicted mean proportional mortality to λ-cyhalothrin across the west and east regions.

(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

(TIF)

S3 Fig. Predicted mean proportional mortality to α-cypermethrin across the west and east regions.

(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

(TIF)

S4 Fig. Histograms of the approximate cross-validated PIT values comparing observations and cumulative predictive densities across all susceptibility test observations for pyrethroids.

Numerical values are provided in S5 Data (10.6084/m9.figshare.9912623).

(TIF)

S5 Fig. Predictions of mean proportional mortality from 10-fold out-of-sample validation performed on the Gaussian process regression meta-model.

The vertical axis shows the corresponding value observed from the bioassay. Values for all data points for all pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) for the west region (red markers) and the east region (blue markers) are shown. The RMSE across all data values is 0.179 (RMSE = 0.191 for the data within the west region and RMSE = 0.166 for the data within the east region). Numerical values are provided in S6 Data (10.6084/m9.figshare.9912623). RMSE, root mean square error.

(TIF)

S6 Fig. The proportion of withheld data points that fell within the predicted CIs, based on 10-fold out-of-sample validation, when accounting for the estimated measurement error (see S1 Text).

Numerical values are provided in S7 Data (10.6084/m9.figshare.9912623). CI, credible interval.

(TIF)

S7 Fig. The prediction error (95% CI) associated with predicted mean mortality to DDT.

See 10.6084/m9.figshare.9912623. CI, credible interval.

(TIF)

S8 Fig

The predicted mean proportional mortality to deltamethrin over time for the point locations in the east (A) and west (B) regions that experienced the greatest overall increase in resistance from 2005 to 2017 (Fig 2; locations A, B, C, D, E, F, G, H, and I). Dashed lines show the 95% CIs of the predicted mean mortality. Numerical values are provided in S8 Data (10.6084/m9.figshare.9912623). CI, credible interval.

(TIF)

S9 Fig

The maximum interannual change in the predicted mean mortality to deltamethrin over the time period 2005–2017 at each location within the west and east regions: (A) the maximum interannual decrease, (B) the maximum interannual increase. Interannual increases and decreases in predicted mortality are calculated as the difference in predictions between 2 consecutive years, for all years 2005 to 2017. See 10.6084/m9.figshare.9912623.

(TIF)

S10 Fig. The Pearson correlation coefficient between each of the 20 variables with the highest weighted variable importance value for the models fitted to the West Africa data set.

(TIF)

S11 Fig. The Pearson correlation coefficient between each of the 20 variables with the highest weighted variable importance value for the models fitted to the East Africa data set.

(TIF)

S1 Table. Fitted parameters of the Bayesian Gaussian process regression models.

Numbers in brackets are the 95% CIs. CI, credible interval.

(DOCX)

S2 Table. The RMSE given by 10-fold out-of-sample validation performed on the Gaussian process regression meta-model fitted to the bioassay records for the four pyrethroid insecticides (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) and each of the 3 machine-learning model constituents.

The unit of the transformed RMSE values corresponds to the (empirical logit and IHS-transformed) observations to which the models were fitted. IHS, inverse hyperbolic sine; RMSE, root mean square error.

(DOCX)

S3 Table. The RMSE given by 10-fold out-of-sample validation performed on the Gaussian process regression meta-model fitted to the bioassay records for DDT and each of the 3 machine-learning model constituents.

The unit of the transformed RMSE values corresponds to the (empirical logit and IHS-transformed) observations to which the models were fitted. IHS, inverse hyperbolic sine; RMSE, root mean square error.

(DOCX)

S4 Table. The fitted weights for each constituent model included in the Gaussian process regression meta-model.

(DOCX)

S5 Table. Variable importance values for predictor variables given by each machine-learning model included in the model ensemble for the west region.

The 30 variables that were most highly ranked by XGB are shown. Definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2) and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two, and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step. XGB, extreme gradient boosting model.

(DOCX)

S6 Table. Variable importance values for predictor variables given by each machine-learning model included in the model ensemble for the east region.

The 30 variables that were most highly ranked by XGB are shown. Definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2) and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step. XGB, extreme gradient boosting model.

(DOCX)

S7 Table. Number of bioassay records for each insecticide type and number of Vgsc allele frequency observations.

(DOCX)

S8 Table. Number of bioassay records for each year for each insecticide class.

(DOCX)

S9 Table. Descriptions of each potential explanatory variable used in the ensemble model.

If the data layer was obtained from an online repository, the URL and date accessed are given. If the data layer has a citation, then this is given.

(DOCX)

S10 Table. l0 models and parameters.

(DOCX)

S1 Text. Supplementary information about the modelling methodology.

(PDF)

Acknowledgments

The authors are extremely grateful to the many people who contributed unpublished data sets and to the authors who provided additional information linked to their published works.

Abbreviations

BGAM

boosted generalized additive model

CI

credible interval

GMRF

Gaussian Markov random field

IHS

inverse hyperbolic sine

IRS

indoor residual spraying

ITN

insecticide-treated net

LLIN

long-lasting insecticide-treated net

MAE

mean absolute error

PET

potential evapotranspiration

PIT

probability integral transform

RF

random forest model

RMSE

root mean square error

Vgsc

Voltage-gated sodium channel

XGB

extreme gradient boosting model

Data Availability

The predictive maps of the mean prevalence of resistance are available to download from Figshare (10.6084/m9.figshare.9912623) and will be available to visualise on the Malaria Atlas Project website (https://map.ox.ac.uk/explorer/#). The susceptibility test data is available to download (https://doi.org/10.1101/582510 [8]). Sets of susceptibility test data and predictor variable data in the form used by the statistical modelling analyses are available from GitHub. Numerical data corresponding to the manuscript figures are available to download from Figshare (10.6084/m9.figshare.9912623). R code for implementing the extreme gradient boosting, random forest, and boosted generalized additive models and the R-INLA geostatistical models is available on GitHub at 10.5281/zenodo.3751786 (Hancock, 2020).

Funding Statement

This work was funded by Wellcome Trust Grant 108440/Z/15/Z (CLM). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Bhatt S, Weiss DJ, Cameron E, Bisanzio D, Mappin B, Dalrymple U, et al. The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015. Nature. 2015;526(7572):207–2011. 10.1038/nature15535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Corbel V, Chabi J, Dabire RK, Etang J, Nwane P, Pigeon O, et al. Field efficacy of a new mosaic long-lasting mosquito net (PermaNet (R) 3.0) against pyrethroid-resistant malaria vectors: a multi centre study in Western and Central Africa. Malar J. 2010;9 10.1186/1475-2875-9-113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Protopopoff N, Mosha JF, Lukole E, Charlwood JD, Wright A, Mwalimu CD, et al. Effectiveness of a long-lasting piperonyl butoxide-treated insecticidal net and indoor residual spray interventions, separately and together, against malaria transmitted by pyrethroid-resistant mosquitoes: a cluster, randomised controlled, two-by-two factorial design trial. Lancet. 2018;391(10130):1577–88. 10.1016/S0140-6736(18)30427-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tangena J-A, Hendricks CJM, Devine M, Tammaro M, Trett AE, de Pina A, et al. Indoor residual spraying for malaria control in Sub-Saharan Africa 1997 to 2017: an adjusted retrospective analysis. Malar J. 2020;19:150 10.1186/s12936-020-03216-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Sherrard-Smith E, Griffin JT, Winskill P, Corbel V, Pennetier C, Djenontin A, et al. Systematic review of indoor residual spray efficacy and effectiveness against Plasmodium falciparum in Africa. Nat Commun. 2018;9 10.1038/s41467-018-07357-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Edi CAV, Koudou BG, Bellai L, Adja AM, Chouaibou M, Bonfoh B, et al. Long-term trends in Anopheles gambiae insecticide resistance in Cote d'Ivoire. Parasit Vectors. 2014;7 10.1186/s13071-014-0500-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ranson H, Lissenden N. Insecticide resistance in African Anopheles mosquitoes: A worsening situation that needs urgent action to maintain malaria control. Trends Parasitol. 2016;32(3):187–96. 10.1016/j.pt.2015.11.010 [DOI] [PubMed] [Google Scholar]
  • 8.Moyes CL, Wiebe A, Gleave K, Trett A, Hancock PA, Padonou GG, et al. Analysis-ready datasets for insecticide resistance phenotype and genotype frequency in African malaria vectors. Sci Data. 2019;6(1):121 10.1038/s41597-019-0134-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.World Health Organization. Global Plan for Insecticide Resistance Management in Malaria Vectors. Geneva: World Health Organization, 2012. [Google Scholar]
  • 10.Coleman M, Hemingway J, Gleave KA, Wiebe A, Gething PW, Moyes CL. Developing global maps of insecticide resistance risk to improve vector control. Malar J. 2017;16 10.1186/s12936-017-1733-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Camara S, Koffi AA, Alou LPA, Koffi K, Kabran JPK, Kone A, et al. Mapping insecticide resistance in Anopheles gambiae (s.l.) from Cote d'Ivoire. Parasit Vectors. 2018;11 10.1186/s13071-017-2546-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wat'senga F, Manzambi EZ, Lunkula A, Mulumbu R, Mampangulu T, Lobo N, et al. Nationwide insecticide resistance status and biting behaviour of malaria vector species in the Democratic Republic of Congo. Malar J. 2018;17 10.1186/s12936-018-2285-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Badolo A, Traore A, Jones CM, Sanou A, Flood L, Guelbeogo WM, et al. Three years of insecticide resistance monitoring in Anopheles gambiae in Burkina Faso: resistance on the rise? Malar J. 2012;11 10.1186/1475-2875-11-232 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Abdalla H, Wilding CS, Nardini L, Pignatelli P, Koekemoer LL, Ranson H, et al. Insecticide resistance in Anopheles arabiensis in Sudan: temporal trends and underlying mechanisms. Parasit Vectors. 2014;7 10.1186/1756-3305-7-213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Kabula B, Tungu P, Malima R, Rowland M, Minja J, Wililo R, et al. Distribution and spread of pyrethroid and DDT resistance among the Anopheles gambiae complex in Tanzania. Med Vet Entomol. 2014;28(3):244–52. 10.1111/mve.12036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hakizimana E, Karema C, Munyakanage D, Iranzi G, Githure J, Tongren JE, et al. Susceptibility of Anopheles gambiae to insecticides used for malaria vector control in Rwanda. Malar J. 2016;15 10.1186/s12936-016-1618-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Foster GM, Coleman M, Thomsen E, Ranson H, Yangalbe-Kalnone E, Moundai T, et al. Spatial and temporal trends in insecticide resistance among malaria vectors in Chad highlight the importance of continual monitoring. PLoS ONE. 2016;11(5). 10.1371/journal.pone.0155746 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Messenger LA, Shililu J, Irish SR, Anshebo GY, Tesfaye AG, Ye-Ebiyo Y, et al. Insecticide resistance in Anopheles arabiensis from Ethiopia (2012–2016): a nationwide study for insecticide resistance monitoring. Malar J. 2017;16 10.1186/s12936-017-2115-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Mandeng SE, Awono-Ambene HP, Bigoga JD, Ekoko WE, Binyang J, Piameu M, et al. Spatial and temporal development of deltamethrin resistance in malaria vectors of the Anopheles gambiae complex from North Cameroon. PLoS ONE. 2019;14(2). 10.1371/journal.pone.0212024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ismail BA, Kafy HT, Sulieman JE, Subramaniam K, Thomas B, Mnzava A, et al. Temporal and spatial trends in insecticide resistance in Anopheles arabiensis in Sudan: outcomes from an evaluation of implications of insecticide resistance for malaria vector control. Parasit Vectors. 2018;11 10.1186/s13071-018-2732-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Chouaibou M, Etang J, Brevault T, Nwane P, Hinzoumbe CK, Mimpfoundi R, et al. Dynamics of insecticide resistance in the malaria vector Anopheles gambiae s.l. from an area of extensive cotton cultivation in Northern Cameroon. Trop Med Int Health. 2008;13(4):476–86. 10.1111/j.1365-3156.2008.02025.x [DOI] [PubMed] [Google Scholar]
  • 22.Bhatt S, Weiss DJ, Mappin B, Dalrymple U, Cameron E, Bisanzio D, et al. Coverage and system efficiencies of insecticide-treated nets in Africa from 2000 to 2017. eLife. 2015;4 10.7554/eLife.09672 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Noor AM, Mutheu JJ, Tatem AJ, Hay SI, Snow RW. Insecticide-treated net coverage in Africa: mapping progress in 2000–07. Lancet. 2009;373(9657):58–67. 10.1016/S0140-6736(08)61596-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Cook J, Tomlinson S, Kleinschmidt I, Donnelly MJ, Akogbeto M, Adechoubou A, et al. Implications of insecticide resistance for malaria vector control with long-lasting insecticidal nets: trends in pyrethroid resistance during a WHO-coordinated multi-country prospective study. Parasit Vectors. 2018;11 10.1186/s13071-018-3101-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Mathias DK, Ochomo E, Atieli F, Ombok M, Bayoh MN, Olang G, et al. Spatial and temporal variation in the kdr allele L1014S in Anopheles gambiae s.s. and phenotypic variability in susceptibility to insecticides in Western Kenya. Malar J. 2011;10 10.1186/1475-2875-10-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Vontas J, Grigoraki L, Morgan J, Tsakireli D, Fuseini G, Segura L, et al. Rapid selection of a pyrethroid metabolic enzyme CYP9K1 by operational malaria control activities. Proc Natl Acad Sci U S A. 2018;115(18):4619–24. 10.1073/pnas.1719663115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Rakotoson JD, Fornadel CM, Belemvire A, Norris LC, George K, Caranci A, et al. Insecticide resistance status of three malaria vectors, Anopheles gambiae (s.l.), An. funestus and An. mascarensis, from the south, central and east coasts of Madagascar. Parasit Vectors. 2017;10 10.1186/s13071-017-2336-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Reid MC, McKenzie FE. The contribution of agricultural insecticide use to increasing insecticide resistance in African malaria vectors. Malar J. 2016;15 10.1186/s12936-016-1162-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hien AS, Soma DD, Hema O, Bayili B, Namountougou M, Gnankine O, et al. Evidence that agricultural use of pesticides selects pyrethroid resistance within Anopheles gambiae s.l. populations from cotton growing areas in Burkina Faso, West Africa. PLoS ONE. 2017;12(3). 10.1371/journal.pone.0173098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Miles A, Harding NJ, Botta G, Clarkson CS, Antao T, Kozak K, et al. Genetic diversity of the African malaria vector Anopheles gambiae. Nature. 2017;552(7683):96–+. 10.1038/nature24995 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Weedall GD, Mugenzi LMJ, Menze BD, Tchouakui M, Ibrahim SS, Amvongo-Adjia N, et al. A cytochrome P450 allele confers pyrethroid resistance on a major African malaria vector, reducing insecticide-treated bednet efficacy. Sci Transl Med. 2019;11 10.1126/scitranslmed.aat7386 [DOI] [PubMed] [Google Scholar]
  • 32.Simard F, Ayala D, Kamdem GC, Pombi M, Etouna J, Ose K, et al. Ecological niche partitioning between Anopheles gambiae molecular forms in Cameroon: the ecological side of speciation. BMC Ecol. 2009;9 10.1186/1472-6785-9-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wiebe A, Longbottom J, Gleave K, Shearer FM, Sinka ME, Massey NC, et al. Geographical distributions of African malaria vector sibling species and evidence for insecticide resistance. Malar J. 2017;16:85 10.1186/s12936-017-1734-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Massey NC, Garrod G, Wiebe A, Henry AJ, Huang Z, Moyes CL, et al. A global bionomic database for the dominant vectors of human malaria. Sci Data. 2016;3:160014 10.1038/sdata.2016.14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Balabanidou V, Kefi M, Aivaliotis M, Koidou V, Girotti JR, Mijailovsky SJ, et al. Mosquitoes cloak their legs to resist insecticides. Proc R Soc Lond B Biol Sci. 2019;286(1907). 10.1098/rspb.2019.1091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.David JP, Ismail HM, Chandor-Proust A, Paine MJI. Role of cytochrome P450s in insecticide resistance: impact on the control of mosquito-borne diseases and use of insecticides on Earth. Philos Trans R Soc Lond B Biol Sci. 2013;368(1612). 10.1098/rstb.2012.0429 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Liu NN. Insecticide Resistance in Mosquitoes: Impact, Mechanisms, and Research Directions. In: Berenbaum MR, editor. Annual Review of Entomology, Vol 60 Annual Review of Entomology. 602015. p. 537–59. 10.1146/annurev-ento-010814-020828 [DOI] [PubMed] [Google Scholar]
  • 38.Lynd A, Oruni A, van't Hof AE, Morgan JC, Naego LB, Pipini D, et al. Insecticide resistance in Anopheles gambiae from the northern Democratic Republic of Congo, with extreme knockdown resistance (kdr) mutation frequencies revealed by a new diagnostic assay. Malar J. 2018;17 10.1186/s12936-018-2561-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lucas ER, Rockett KA, Lynd A, Essandoh J, Grisales N, Kemei B, et al. A high throughput multi-locus insecticide resistance marker panel for tracking resistance emergence and spread in Anopheles gambiae. Sci Rep. 2019;9 10.1038/s41598-019-49892-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Christiaensen L, Demery L, Dillon B, Barrett CB, Deininger KW, Kaminski J. Agriculture in Africa: telling myths from facts Washington D.C.: World Bank, 2018. [Google Scholar]
  • 41.Donnelly MJ, Simard F, Lehmann T. Evolutionary studies of malaria vectors. Trends Parasitol. 2002;18(2):75–80. 10.1016/s1471-4922(01)02198-5 [DOI] [PubMed] [Google Scholar]
  • 42.Lehmann T, Licht M, Elissa N, Maega BTA, Chimumbwa JM, Watsenga FT, et al. Population structure of Anopheles gambiae in Africa. J Hered. 2003;94(2):133–47. 10.1093/jhered/esg024 [DOI] [PubMed] [Google Scholar]
  • 43.Neafsey DE, Lawniczak MKN, Park DJ, Redmond SN, Coulibaly MB, Traore SF, et al. SNP genotyping defines complex gene-flow boundaries among African malaria vector mosquitoes. Science. 2010;330(6003):514–7. 10.1126/science.1193036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Pinto J, Egyir-Yawson A, Vicente JL, Gomes B, Santolamazza F, Moreno M, et al. Geographic population structure of the African malaria vector Anopheles gambiae suggests a role for the forest-savannah biome transition as a barrier to gene flow. Evol Appl. 2013;6(6):910–24. 10.1111/eva.12075 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Barnes KG, Irving H, Chiumia M, Mzilahowa T, Coleman M, Hemingway J, et al. Restriction to gene flow is associated with changes in the molecular basis of pyrethroid resistance in the malaria vector Anopheles funestus. Proc Natl Acad Sci U S A. 2017;114(2):286–91. 10.1073/pnas.1615458114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Weetman D, Wilding CS, Steen K, Pinto J, Donnelly MJ. Gene flow-dependent genomic divergence between Anopheles gambiae M and S forms. Mol Biol Evol. 2012;29(1):279–91. 10.1093/molbev/msr199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Lee Y, Marsden CD, Norris LC, Collier TC, Main BJ, Fofana A, et al. Spatiotemporal dynamics of gene flow and hybrid fitness between the M and S forms of the malaria mosquito, Anopheles gambiae. Proc Natl Acad Sci U S A. 2013;110(49):19854–9. 10.1073/pnas.1316851110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Clarkson CS, Weetman D, Essandoh J, Yawson AE, Maslen G, Manske M, et al. Adaptive introgression between Anopheles sibling species eliminates a major genomic island but not reproductive isolation. Nat Commun. 2014;5 10.1038/ncomms5248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Ochomo E, Subramaniam K, Kemei B, Rippon E, Bayoh NM, Kamau L, et al. Presence of the knockdown resistance mutation, Vgsc-1014F in Anopheles gambiae and An. arabiensis in western Kenya. Parasit Vectors. 2015;8 10.1186/s13071-015-1223-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Caputo B, Pichler V, Mancini E, Pombi M, Vicente JL, Dinis J, et al. The last bastion? X chromosome genotyping of Anopheles gambiae species pair males from a hybrid zone reveals complex recombination within the major candidate 'genomic island of speciation'. Mol Ecol. 2016;25(22):5719–31. 10.1111/mec.13840 [DOI] [PubMed] [Google Scholar]
  • 51.Vicente JL, Clarkson CS, Caputo B, Gomes B, Pombi M, Sousa CA, et al. Massive introgression drives species radiation at the range limit of Anopheles gambiae. Sci Rep. 2017;7 10.1038/srep46451 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Yunta C, Hemmings K, Stevenson B, Koekemoer LL, Matambo T, Pignatelli P, et al. Cross-resistance profiles of malaria mosquito P450s associated with pyrethroid resistance against WHO insecticides. Pestic Biochem Physiol. 2019;161:61–7. 10.1016/j.pestbp.2019.06.007 [DOI] [PubMed] [Google Scholar]
  • 53.Bhatt S, Cameron E, Flaxman SR, Weiss DJ, Smith DL, Gething PW. Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization. Journal of the Royal Society Interface. 2017;14(134). 10.1098/rsif.2017.0520 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Blangiardo M, Cameletti M. Spatial and Spatio-Temporal Bayesian Models with R-INLA2015. 1–308 p.
  • 55.Hancock PA, Wiebe A, Gleave KA, Bhatt S, Cameron E, Trett A, et al. Associated patterns of insecticide resistance in field populations of malaria vectors across Africa. Proc Natl Acad Sci U S A. 2018;115(23):5938–43. 10.1073/pnas.1801826115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Gelman A, Hwang J, Vehtari A. Understanding predictive information criteria for Bayesian models. Stat Comput. 2014;24(6):997–1016. 10.1007/s11222-013-9416-2 [DOI] [Google Scholar]
  • 57.Gneiting T, Raftery AE. Strictly proper scoring rules, prediction, and estimation. J Am Stat Assoc. 2007;102(477):359–78. 10.1198/016214506000001437 [DOI] [Google Scholar]
  • 58.Lindgren F, Rue H, Lindstrom J. An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach. J R Stat Soc B. 2011;73:423–98. 10.1111/j.1467-9868.2011.00777.x [DOI] [Google Scholar]
  • 59.Churcher TS, Lissenden N, Griffin JT, Worrall E, Ranson H. The impact of pyrethroid resistance of the efficacy and effectiveness of bednets for malaria control in Africa. eLife. 2016;5 10.7554/eLife.16090 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kamau L, Mukabana WR, Hawley WA, Lehmann T, Irungu LW, Orago AAS, et al. Analysis of genetic variability in Anopheles arabiensis and Anopheles gambiae using microsatellite loci. Insect Mol Biol. 1999;8(2):287–97. 10.1046/j.1365-2583.1999.820287.x [DOI] [PubMed] [Google Scholar]
  • 61.Lehmann T, Blackston CR, Besansky NJ, Escalante AA, Collins FH, Hawley WA. The Rift Valley complex as a barrier to gene flow for Anopheles gambiae in Kenya: The mtDNA perspective. J Hered. 2000;91(2):165–8. 10.1093/jhered/91.2.165 [DOI] [PubMed] [Google Scholar]
  • 62.Gericke A, Govere JM, Durrheim DN. Insecticide susceptibility in the South African malaria mosquito Anopheles arabiensis (Diptera: Culicidae). S Afr J Sci. 2002;98(3–4):205–8. [Google Scholar]
  • 63.Killeen GF, Kiware SS, Okumu FO, Sinka ME, Moyes CL, Massey NC, et al. Going beyond personal protection against mosquito bites to eliminate malaria transmission: population suppression of malaria vectors that exploit both human and animal blood. BMJ Glob Health. 2017;2(2). 10.1136/bmjgh-2016-000198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Protopopoff N, Matowo J, Malima R, Kavishe R, Kaaya R, Wright A, et al. High level of resistance in the mosquito Anopheles gambiae to pyrethroid insecticides and reduced susceptibility to bendiocarb in north-western Tanzania. Malar J. 2013;12 10.1186/1475-2875-12-149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Sinka ME, Golding N, Massey NC, Wiebe A, Huang Z, Hay SI, et al. Modelling the relative abundance of the primary African vectors of malaria before and after the implementation of indoor, insecticide-based vector control. Malar J. 2016;15 10.1186/s12936-016-1187-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Hendriks CJM, Gibson H, Trett A, Python A, Weiss DJ, Vrieling A, et al. Mapping geospatial processes affecting the environmental fate of agricultural pesticides in Africa. Int J Environ Res Public Health. 2019;16(3523): 10.3390/ijerph16193523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Weetman D, Wilding CS, Neafsey DE, Muller P, Ochomo E, Isaacs AT, et al. Candidate-gene based GWAS identifies reproducible DNA markers for metabolic pyrethroid resistance from standing genetic variation in East African Anopheles gambiae. Sci Rep. 2018;8 10.1038/s41598-018-21265-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lucas ER, Miles A, Harding NJ, Clarkson CS, Lawniczak MKN, Kwiatkowski DP, et al. Whole-genome sequencing reveals high complexity of copy number variation at insecticide resistance loci in malaria mosquitoes. Genome Res. 2019;29(8):1250–61. 10.1101/gr.245795.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.The Anopheles gambiae 1000 Genomes Consortium. Genetic diversity of the African malaria vector Anopheles gambiae. Nature. 2017;96(552). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Bayoh MN, Mathias DK, Odiere MR, Mutuku FM, Kamau L, Gimnig JE, et al. Anopheles gambiae: historical population decline associated with regional distribution of insecticide-treated bed nets in western Nyanza Province, Kenya. Malar J. 2010;9 10.1186/1475-2875-9-62 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Sinka ME, Bangs MJ, Manguin S, Rubio-Palis Y, Chareonviriyaphap T, Coetzee M, et al. A global map of dominant malaria vectors. Parasit Vectors. 2012;5 10.1186/1756-3305-5-69 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Friedl M, Sulla-Menashe D. MCD12Q1 MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC; 2015. [Google Scholar]
  • 73.You L, Wood-Sichra U, Fritz S, Guo Z, See L, Koo J. Spatial production allocation model (SPAM) 2005 v2.0: mapspam.info. Available from: mapspam.info.
  • 74.Tatem AJ. WorldPop, open data for spatial demography. Sci Data. 2017;4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Sulla-Menashe D, Gray JM, Abercrombie SP, Friedl MA. Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product. Remote Sensing of Environment. 2019;222:183–94. 10.1016/j.rse.2018.12.013 [DOI] [Google Scholar]
  • 76.Esch T, Heldens W, Hirner A, Keil M, Marconcini M, Roth A, et al. Breaking new ground in mapping human settlements from space—The Global Urban Footprint. ISPRS J Photogramm Remote Sens. 2017;134:30–42. 10.1016/j.isprsjprs.2017.10.012 [DOI] [Google Scholar]
  • 77.Funk C, Peterson P, Landsfeld DP, Verdin J, Shukla S, Husak G, et al. The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci Data. 2 10.1038/sdata.2015.66 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Trabucco A, Zomer RJ. Global Aridity Index (Global-Aridity) and Global Potential Evapo-Transpiration (Global-PET) Geospatial Database. CGIAR-CSI GeoPortal; 2009. [Google Scholar]
  • 79.Weiss DJ, Lucas TCD, Nguyen M, Nandi A, Bisanzio D, Battle KE, et al. The global landscape of Plasmodium falciparum prevalence, incidence and mortality 2000–2017. Lancet. 2019;accepted. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Ting KM, Witten IH. Stacked generalization: when does it work? In: Pollack ME, editor. Ijcai-97—Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, Vols 1 and 2. International Joint Conference on Artificial Intelligence; 1997. p. 866–71. [Google Scholar]
  • 81.Wolpert DH. Stacked generalization. Neural Networks. 1992;5(2):241–59. 10.1016/s0893-6080(05)80023-1 [DOI] [Google Scholar]
  • 82.Burbidge JB, Magee L, Robb AL. Alternative transformations to handle extreme values of the dependent variable. J Am Stat Assoc. 1988;83(401):123–7. 10.2307/2288929 [DOI] [Google Scholar]
  • 83.Vinayak RK, Gilad-Bachrach R. DART: Dropouts meet Multiple Additive Regression Trees In: Guy L, Vishwanathan SVN, editors. Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics; Proceedings of Machine Learning Research: PMLR; 2015. p. 489–97. [Google Scholar]
  • 84.Cameletti M, Lindgren F, Simpson D, Rue H. Spatio-temporal modeling of particulate matter concentration through the SPDE approach. Asta-Advances in Statistical Analysis. 2013;97(2):109–31. 10.1007/s10182-012-0196-3 [DOI] [Google Scholar]
  • 85.Rue H, Held L. Gaussian Markov Random Fields: Theory and Applications. Boca Raton, FL: Chapman & Hall. [Google Scholar]
  • 86.Breiman L. Stacked regressions. Machine Learning. 1996;24(1):49–64. [Google Scholar]
  • 87.Chen T, Guestrin C. XGBoost: A scalable tree boosting system Proceedings of the 22nd ACM SIGDD International Conference on Knowledge Discovery and Data Mining. KDD ‘16. New York, NY, USA: ACM; 2016. p. 785–94. [Google Scholar]
  • 88.Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2(3):18–22. [Google Scholar]
  • 89.Hothorn T, Buehlmann P, Kneib T, Schmid M, Hofner B. mboost: Model-Based Boosting. R package version 2.9–1 ed. https://CRAN.R-project.org/package = mboost: R package version 2.9–1; 2018. [Google Scholar]
  • 90.Hancock PA. pahanc/Mapping-insecticide-resistance: Insecticide resistance model ensemble. 2020. 10.5281/zenodo.3751786 [DOI] [Google Scholar]

Decision Letter 0

Lauren A Richardson

31 Dec 2019

Dear Dr Hancock,

Thank you for submitting your manuscript entitled "MAPPING TRENDS IN INSECTICIDE RESISTANCE PHENOTYPES IN AFRICAN MALARIA VECTORS" for consideration as a Research Article by PLOS Biology.

Your manuscript has now been evaluated by the PLOS Biology editorial staff as well as by an academic editor with relevant expertise and I am writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Please re-submit your manuscript by Jan 07 2020 11:59PM.

Login to Editorial Manager here: https://www.editorialmanager.com/pbiology

During resubmission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF when you re-submit.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed all checks it will be sent out for review.

***Please be aware that, due to the voluntary nature of our reviewers and academic editors, manuscripts may be subject to delays due to their limited availability during the holiday season. Please also note that the journal office will be closed entirely 21st- 29th December inclusive, and 1st January 2020. Thank you for your patience.***

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Lauren A Richardson, Ph.D

Senior Editor

PLOS Biology

Decision Letter 1

Lauren A Richardson

5 Feb 2020

Dear Dr Hancock,

Thank you very much for submitting your manuscript "Mapping Trends In Insecticide Resistance Phenotypes In African Malaria Vectors" for consideration as a Research Article at PLOS Biology. Your manuscript has been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by several independent reviewers.

As you will read, the reviewers appreciated many aspects of your work and the importance of these maps. They also raise a number of points that will need to be addressed in a revision. Of particular note, we like the suggestion of Rev #2 to add the data of An.funestus resistance, if it is available. Rev #3 also raises a number of excellent points. We agree that these type of additional analyses would improve the manuscript, but do not consider them to be absolutely essential for a revision. We would like you to consider them and address them as best you can, providing clear explanation for those analyses you do not choose to include in the revision.

In light of the reviews (below), we will not be able to accept the current version of the manuscript, but we would welcome re-submission of a much-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent for further evaluation by the reviewers.

We expect to receive your revised manuscript within 2 months.

Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Lauren A Richardson, Ph.D

Senior Editor

PLOS Biology

*****************************************************

REVIEWS:

Reviewer #1:

I really enjoyed reading the manuscript MAPPING TRENDS IN INSECTICIDE RESISTANCE PHENOTYPES IN AFRICAN MALARIA VECTORS from Hancock and colleagues. It is well written (although I would spell out all the acronyms, since some reader may not be familiar with this topic) and the methods scientifically sound. The algorithm development and properties are coherent with the aim of the project, i.e. maximize the predictive capacity of the model (an ensemble). I congratulate the authors for the effort and for providing detailed information about the model behavior and predictions.

The paper is in line with PLOS Biology scope and aims and I suggest acceptance after the authors revise it in accordance to my corrections/questions.

Raised points:

Abstract.

Is it possible to quantify the "alarming increase", e.g. "since 2005 we have shown a X fold increase in malaria resistance across…"

Introduction

Line 74 should be "after the introduction of these interventions".

The passage from the paragraph in lines 63-83 to the paragraph in lines 85-98 is a bit abrupt. Is it possible to add a paragraph on how this problem (mapping insecticide resistance) has been dealt before? Also how current models fails in achieving what you have achieved.

Results

If I was a general reader, I would found very difficult the interpretation of the RMSE for the comparison of the 4 models. First of all, in table S2 and S3 keep consistency with names (GAMB or BGAM). In terms of values it is not clear to me why you use different units between the main text and the supplementary. If necessary, please add a sentence in the main text.

If I compare figure 1 and 5, it seems to me that lines F,G,H,I and relative areas (figure 1) may not conclusive since uncertainty is close to 1 or very high (Figure 5). Please comment in the discussions or provide a bit more information. In addition, looking at table S7 and S8 I speculate that uncertainty increases although samples are increasing too. Is this the effect of interventions that by fragmenting the "continuous" resistance, reduces the capacity of the spatio-temporal dependence to explain part of the variation?

I found very interesting the importance of rainfall in your models, especially for East Africa which was affected by two important droughts during your study period. Please can you add a comment on this?

Methods

Variable importance. Instead of using different functions (and packages) to evaluate variable importance for each model, it would not be easier to just remove a variable at time and look at the RMSE?

----------------

Reviewer #2:

Review PBIOLOGY-D-19-03626R1

The paper describes the spatial and temporal trend of insecticide resistance phenotypes in West and East African region using a large database (Moyes CL et al, Scientific dada, 2019) available in open access. Report of insecticide resistance increase in Africa is not new and several others authors have reported this trend in Africa. What is novel in the paper is the visualisation of this increase in resistance over time and space and differences between the West and East block of countries. The paper also explores and ranks factors that predicts the prevalence of insecticide resistance. This is a clear, concise and a well written manuscript of great importance.

I have however some major comments

* The results for East Africa is not as convincing than the West Africa one, as the authors omit An.funestus resistance that has a major roles in the Southern part of the continent but is also in the past few years increasingly present in East Africa countries (Burundi, Tanzania, Uganda, Kenya). The An.funestus resistance data are also available and applying the model would provide important information that are missing in the actual paper especially as the models aimed to assist decision making in resistance management. The frequency of insecticide resistance in East Africa is not as widespread than in West Africa and resistance management in this area might be the most successful.

* The contribution of public health vs agriculture insecticides to selection of resistance is essential to device appropriate resistance management. One of the main limitation concerning the predictors is the lack of information on the insecticide/pesticide use for agriculture. The authors used different variables as a proxy, land use of crops and other land cover, fate of insecticides in the environment, etc. These variables correspond to more than half of the variables included in the model. Variables that cover the widest area modelled (ITN, Rainfall, temperature etc..) seems to be the highest contributors. I am therefore wondering if the apparent lower contribution of agriculture pesticide in An. gambiae insecticide resistance is due to the number of variables that have been included. Could the land use variables representing crops using pesticides the most, be pulled together as a single variable in the model? While I understand the difficulty to get the data on agriculture pesticide and find appropriate proxies, the authors should discuss, more in details, some of their findings and differences observed between West and East Africa. The model seems to suggest that agriculture pesticide use is a more important contributor in West than East Africa. Please comments.

* The authors have discussed the difference in insecticide resistance between An.gambiae and An.arabiensis. I have 4 points, I wanted to raise on this.

o The prediction model gives a good idea of the trends in West Africa as well as predicting factors. The resistance trend in the vectors in West Africa (gamiae/coluzzi) is more homogenous than in the other sibling species in East Africa Arabiensis/gambiae. Could it explain the wider credible interval in the prediction they found in East compare to West Africa?

o The authors include a variable on relative abundance of Arabiensis vs gambiae/coluzzi. which do not seem to contribute to the overall model in East Africa. Please would you be able to explain.

o There are reports showing a shift toward An.arabiensis that seems to be associated with IRS and ITN. Would this and lower insecticide resistance prevalence in An.arabiensis explained why ITN coverage is not as strongly predictive for resistance in East Africa compared to West Africa?

o Contribution of Rainfall variables, please could you explain why do you think there is a difference in importance between West and East Africa. Could rainfall be a proxy for increase in An.gambiae abundance and therefore insecticide resistance? Or?

* Overall, I would suggest the authors to expand on their discussions especially on the contribution factors and possible explanation of differences between West and East Africa.

Minor comments,

* Why was the variable "Proportional abundance of An. arabiensis to An.coluzzii/gambiae" considered as static? The authors indicated this variable was based on collection from1985 -2015. Would the variable not be able to predict better the resistance if it was included as a yearly or bi-annual variable?

* The authors mentioned that insecticide use for agriculture was not available therefore different crops and livestock production were used. How were these 30 crops selected? Is it based on known insecticide use?

* The authors indicate that "Our predictive maps of mean prevalence of resistance are available to visualise alongside the latest susceptibility test data on the IR mapper website (http://www.irmapper.com), and can guide decisions about resistance management at regional and local levels. In making recommendations, our results will need to be considered in combination with (i) data from resistance monitoring of field samples, including other malaria vector species such as An. funestus; (ii) data on the presence of underlying mechanisms of resistance, and (iii) analyses of the expected impacts of resistance management strategies on malaria prevalence".

Would the maps and predictors be available for each country? Otherwise I am not convinced that the overall maps and results as presented in the paper would be enough to help decision making at national level.

----------------

Reviewer #3: Tovi Lehmann, signed review

Mapping Trends in Insecticide resistance Phenotypes in Africa malaria vectors

By: Hancock et al.

Summary:

This is a well written, interesting paper that I enjoyed reading. It is based an analytic modeling to describe the change of resistance to pyrethroids in the African malaria mosquito, A. gambiae s.l.. Using ~6500 test of mosquito resistance assays performed between 2005 and 2017 and incorporating >100 explanatory variables including application of the pesticides, cross-resistance between insecticide types (including DDT), climate, hydrology, vegetation cover, mosquito species composition, etc., the model generated predictive dynamic (time stamped) maps that describe the level of resistance across large parts of West and East Sub-Saharan Africa. Consistent with other studies, the results confirmed that resistance to Pyrethroids dramaticall increased (from 15% to 98%) over that time in W Africa, but from 9 to 45% in E Africa. Resistance to DDT also increased following similar pattern but to a lesser extent, given the higher starting rates of resistance to DDT in the area. The authors also assess the factors that explain the model and thus likely to affect the evolution of resistance. This analysis highlights the complexity of the factors and difficulty in narrowing the number of key factors down. Among the many effects, coverage of ITNs was highest in W Africa although it was modestly important (and ranked 8th) in E Africa. Yet, it appears that the success of the model depends on small effects of >100 factors or nearly so. The main strength of the study is the production of regional maps that can guide decisions about the management of resistance in areas where few or no tests of resistance were done. However, the authors caution that their predictions are no replacement for such tests and mostly call attention to low confidence indicator for areas where more tests are needed.

I applaud the authors for a valuable advance in the management of insecticide resistance and their application of state-of-the-art computing to generate these impressive maps. The careful interpretation of the results is also a strength of this paper. Yet, I feel that this paper better fits a journal with more specialized readership in a narrower field of malaria epidemiology than PlosBiology. I note below that if the authors extend their analyses to address additional new questions, then their work could well fit with PlosBiology's broad readership. I hope my comments below will help the authors decide about the best course for their paper.

General Comments:

1. Given the exceptional analytic skills and unique data, I wonder if the authors would focus on some additional questions (using regions/time sections that are suitable to address them as applicable) that will "elevate the interest" in their work. For example (a) Are true "reversals of resistance" evident? How to discern true reversals? If observed, how common are they? What factors may account for them? Are they more common during early evolution phase (when mutations are presumably more costly) and how they can be exploited in management of resistance?

b) are there islands of susceptibility and resistance that are stable. If so where, how long, and what could account for them (see above).

c) is there a way to infer the "lower rate of spatial spread of resistance" (assuming arriving from nearest area) over distance and possibly certain barriers?

d) how unstable are resistance profiles at a location, so if we have several resistance tests in a region that had indicated 51-67% 2,3,4, or up to 7 years ago, how important is to do that test this year if there is no dramatic change in insecticide use and if there was. How such answers change based on availability of a complete time series 10, 50, 100, or 500 km away. So, can we refine the guidance when and where we need new tests.

2. The term "spread of resistance" is complex and may better replaced here or be explained in the outset. Intuitively, it conveys the notion of (1) spread from point A to B, which is not the case here. The authors include (2) the rise in frequency of resistance in the same region, and (3) the spread from unknown multiple sources so that no point A to B can be given. A clarification would be helpful.

3. Maps are very good to convey "big picture" of changes in resistance level over time but seem a little too perfect and convey a false notion that all is known. I cannot tell which parts of a map are close to "observed", which show gaps between observed and predicted (how big the gap), and which are "predicted" with no observed to compare with.

These are a few examples for questions that once being addressed may provide new insights that will merit publication in a broad audience journal such as PlosBiology.

Specific comments

Introduction

- why the range of data used/selected was 2005-17? Given that there is much widespread resistance in 2005, it would be interesting to start earlier, even if it would address smaller areas.

-. It is mentioned that pyrethroids have been extensively used and given the analysis by different types of pyrethroids, it will help knowing about the difference in sensitivity for each type, mechanism of resistance to each type, and historical use of these types in different areas. Some of this info can form a basis for expectations to judge the maps against.

- The introduction led me to assume that there were no previous papers on the origin and spread of mechanisms of resistance which were depicted by maps. If this is incorrect, it'd be helpful to refer to their maps and underlying basis.

Results

I believe if date-stamp is 2005, data were taken only during this year (not from a range, e.g., 2004-2006). Correct?

L 198 -214. -

- I assume 'out-of-sample' means removing an observation and using the rest of the data to predict it. Is 10-fold means that it was repeated 10 times across all observation, one at a time or 90% of the data were used? It needs to be explained.

- I believe that RMSE should be given side by side with MAE, which is more directly applicable to the data and is less susceptible to outliers.

- I think this stat is more meaningful conditioned on distance (spatial and temporal if there were multiple observations in the time interval and only 1 was removed) from nearest data points. Further, I'd report the 'adjusted' value to reflect the mean and median distance between observed and predicted that the map shows.

- I agree that validation of model prediction is vital and fit in the results, although this section is written too technically, and most readers won't be able to interpret it. I suggest keeping the section but transfer parts to the methods and in their place, provide biological relevant predictions capturing a couple of the better and worse predictions pertaining to relevant space and time points.

Decision Letter 2

Roland G Roberts

31 Mar 2020

Dear Dr Hancock,

Thank you for submitting your revised Research Article entitled "MAPPING TRENDS IN INSECTICIDE RESISTANCE PHENOTYPES IN AFRICAN MALARIA VECTORS" for publication in PLOS Biology. I have now obtained advice from two of the original reviewers and have discussed their comments with the Academic Editor.

Based on the reviews, we will probably accept this manuscript for publication, assuming that you will modify the manuscript to address the remaining points raised by reviewer #3. Please also make sure to address my Data Policy-related requests noted at the end of this email.

We expect to receive your revised manuscript within two weeks. Your revisions should address the specific points made by each reviewer. In addition to the remaining revisions and before we will be able to formally accept your manuscript and consider it "in press", we also need to ensure that your article conforms to our guidelines. A member of our team will be in touch shortly with a set of requests. As we can't proceed until these requirements are met, your swift response will help prevent delays to publication.

*Copyediting*

Upon acceptance of your article, your final files will be copyedited and typeset into the final PDF. While you will have an opportunity to review these files as proofs, PLOS will only permit corrections to spelling or significant scientific errors. Therefore, please take this final revision time to assess and make any remaining major changes to your manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Early Version*

Please note that an uncorrected proof of your manuscript will be published online ahead of the final version, unless you opted out when submitting your manuscript. If, for any reason, you do not want an earlier version of your manuscript published online, uncheck the box. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods

*Submitting Your Revision*

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include a cover letter, a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable), and a track-changes file indicating any changes that you have made to the manuscript.

Please do not hesitate to contact me should you have any questions.

Sincerely,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor

PLOS Biology

------------------------------------------------------------------------

DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Many thanks for depositing your data in Figshare; this is much appreciated. However, my understanding is that this largely comprises the "raw" geographical raster data; we also ask that all the numerical values summarized graphically in the figures and results of your paper be made available in one of the following forms:

1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication.

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figs 1A, 3, 5, 7, 8, 9, S4, S5, S6, S8. NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

Please also ensure that figure legends in your manuscript include information on where the underlying data can be found (i.e. the Figshare deposition and the bioRxiv DOI), and ensure your supplemental data file/s has a legend.

Please ensure that your Data Statement in the submission system accurately describes where your data can be found.

------------------------------------------------------------------------

REVIEWERS' COMMENTS:

Reviewer #2:

The authors have addressed all my comments and answered my questions adequately. I am glad they included An.funestus results even if the number of data point were much lower than An.gambiae.

Reviewer #3:

[Identifies himself as Tovi Lehmann]

Overall the revisions are thorough and satisfactorily address all the key points. The sophisticated methodology is explained in less-technical terms and the new graphs help provide a more complete picture. I enjoyed reading the paper and feel it will resonate with much interest. So my recommendation is "Accept".

I understand if the authors prefer to expand on that elsewhere, but I would like to hear their comments (2-3 sentences) on the relationships between insecticide resistance and corresponding malaria decline, or lack of decline over that 13-15 years time frame. Their analysis may provide a very valuable if not a decisive answer to this important yet incompletely addressed question.

Minor points (up to authors to decide if they would like to address):

- L270-2. 'Attenuations and declines in resistance may reflect fitness costs of resistance, or they may also arise due to shifts in the composition of the sibling species that make up the An. gambiae complex (see the Discussion).'

I'd consider adding a the contribution of migration from areas with low resistance.

L 470 'in populations of Gambiae complex mosquito species' , consider revising this fragment. At least, change to ..An. gambiae complex...

Decision Letter 3

Roland G Roberts

11 May 2020

Dear Dr Hancock,

On behalf of my colleagues and the Academic Editor, Andrew Fraser Read, I am pleased to inform you that we will be delighted to publish your Research Article in PLOS Biology.

The files will now enter our production system. You will receive a copyedited version of the manuscript, along with your figures for a final review. You will be given two business days to review and approve the copyedit. Then, within a week, you will receive a PDF proof of your typeset article. You will have two days to review the PDF and make any final corrections. If there is a chance that you'll be unavailable during the copy editing/proof review period, please provide us with contact details of one of the other authors whom you nominate to handle these stages on your behalf. This will ensure that any requested corrections reach the production department in time for publication.

Early Version

The version of your manuscript submitted at the copyedit stage will be posted online ahead of the final proof version, unless you have already opted out of the process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for submitting your manuscript to PLOS Biology and for your support of Open Access publishing. Please do not hesitate to contact me if I can provide any assistance during the production process.

Kind regards,

Alice Musson

Publishing Editor,

PLOS Biology

on behalf of

Roland Roberts,

Senior Editor

PLOS Biology

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Predicted mean proportional mortality to permethrin across the west and east regions.

    (A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

    (TIF)

    S2 Fig. Predicted mean proportional mortality to λ-cyhalothrin across the west and east regions.

    (A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

    (TIF)

    S3 Fig. Predicted mean proportional mortality to α-cypermethrin across the west and east regions.

    (A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.

    (TIF)

    S4 Fig. Histograms of the approximate cross-validated PIT values comparing observations and cumulative predictive densities across all susceptibility test observations for pyrethroids.

    Numerical values are provided in S5 Data (10.6084/m9.figshare.9912623).

    (TIF)

    S5 Fig. Predictions of mean proportional mortality from 10-fold out-of-sample validation performed on the Gaussian process regression meta-model.

    The vertical axis shows the corresponding value observed from the bioassay. Values for all data points for all pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) for the west region (red markers) and the east region (blue markers) are shown. The RMSE across all data values is 0.179 (RMSE = 0.191 for the data within the west region and RMSE = 0.166 for the data within the east region). Numerical values are provided in S6 Data (10.6084/m9.figshare.9912623). RMSE, root mean square error.

    (TIF)

    S6 Fig. The proportion of withheld data points that fell within the predicted CIs, based on 10-fold out-of-sample validation, when accounting for the estimated measurement error (see S1 Text).

    Numerical values are provided in S7 Data (10.6084/m9.figshare.9912623). CI, credible interval.

    (TIF)

    S7 Fig. The prediction error (95% CI) associated with predicted mean mortality to DDT.

    See 10.6084/m9.figshare.9912623. CI, credible interval.

    (TIF)

    S8 Fig

    The predicted mean proportional mortality to deltamethrin over time for the point locations in the east (A) and west (B) regions that experienced the greatest overall increase in resistance from 2005 to 2017 (Fig 2; locations A, B, C, D, E, F, G, H, and I). Dashed lines show the 95% CIs of the predicted mean mortality. Numerical values are provided in S8 Data (10.6084/m9.figshare.9912623). CI, credible interval.

    (TIF)

    S9 Fig

    The maximum interannual change in the predicted mean mortality to deltamethrin over the time period 2005–2017 at each location within the west and east regions: (A) the maximum interannual decrease, (B) the maximum interannual increase. Interannual increases and decreases in predicted mortality are calculated as the difference in predictions between 2 consecutive years, for all years 2005 to 2017. See 10.6084/m9.figshare.9912623.

    (TIF)

    S10 Fig. The Pearson correlation coefficient between each of the 20 variables with the highest weighted variable importance value for the models fitted to the West Africa data set.

    (TIF)

    S11 Fig. The Pearson correlation coefficient between each of the 20 variables with the highest weighted variable importance value for the models fitted to the East Africa data set.

    (TIF)

    S1 Table. Fitted parameters of the Bayesian Gaussian process regression models.

    Numbers in brackets are the 95% CIs. CI, credible interval.

    (DOCX)

    S2 Table. The RMSE given by 10-fold out-of-sample validation performed on the Gaussian process regression meta-model fitted to the bioassay records for the four pyrethroid insecticides (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) and each of the 3 machine-learning model constituents.

    The unit of the transformed RMSE values corresponds to the (empirical logit and IHS-transformed) observations to which the models were fitted. IHS, inverse hyperbolic sine; RMSE, root mean square error.

    (DOCX)

    S3 Table. The RMSE given by 10-fold out-of-sample validation performed on the Gaussian process regression meta-model fitted to the bioassay records for DDT and each of the 3 machine-learning model constituents.

    The unit of the transformed RMSE values corresponds to the (empirical logit and IHS-transformed) observations to which the models were fitted. IHS, inverse hyperbolic sine; RMSE, root mean square error.

    (DOCX)

    S4 Table. The fitted weights for each constituent model included in the Gaussian process regression meta-model.

    (DOCX)

    S5 Table. Variable importance values for predictor variables given by each machine-learning model included in the model ensemble for the west region.

    The 30 variables that were most highly ranked by XGB are shown. Definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2) and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two, and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step. XGB, extreme gradient boosting model.

    (DOCX)

    S6 Table. Variable importance values for predictor variables given by each machine-learning model included in the model ensemble for the east region.

    The 30 variables that were most highly ranked by XGB are shown. Definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2) and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step. XGB, extreme gradient boosting model.

    (DOCX)

    S7 Table. Number of bioassay records for each insecticide type and number of Vgsc allele frequency observations.

    (DOCX)

    S8 Table. Number of bioassay records for each year for each insecticide class.

    (DOCX)

    S9 Table. Descriptions of each potential explanatory variable used in the ensemble model.

    If the data layer was obtained from an online repository, the URL and date accessed are given. If the data layer has a citation, then this is given.

    (DOCX)

    S10 Table. l0 models and parameters.

    (DOCX)

    S1 Text. Supplementary information about the modelling methodology.

    (PDF)

    Attachment

    Submitted filename: Responses to reviewers.pdf

    Attachment

    Submitted filename: Response to reviewers II.pdf

    Data Availability Statement

    The predictive maps of the mean prevalence of resistance are available to download from Figshare (10.6084/m9.figshare.9912623) and will be available to visualise on the Malaria Atlas Project website (https://map.ox.ac.uk/explorer/#). The susceptibility test data is available to download (https://doi.org/10.1101/582510 [8]). Sets of susceptibility test data and predictor variable data in the form used by the statistical modelling analyses are available from GitHub. Numerical data corresponding to the manuscript figures are available to download from Figshare (10.6084/m9.figshare.9912623). R code for implementing the extreme gradient boosting, random forest, and boosted generalized additive models and the R-INLA geostatistical models is available on GitHub at 10.5281/zenodo.3751786 (Hancock, 2020).


    Articles from PLoS Biology are provided here courtesy of PLOS

    RESOURCES