Skip to main content
eLife logoLink to eLife
. 2014 Apr 24;3:e02130. doi: 10.7554/eLife.02130

A micro-epidemiological analysis of febrile malaria in Coastal Kenya showing hotspots within hotspots

Philip Bejon 1,2,*, Thomas N Williams 1,3, Christopher Nyundo 1, Simon I Hay 4, David Benz 4, Peter W Gething 4, Mark Otiende 1, Judy Peshu 1, Mahfudh Bashraheil 1, Bryan Greenhouse 5, Teun Bousema 6,7, Evasius Bauni 1, Kevin Marsh 1,2, David L Smith 8, Steffen Borrmann 1,9,10
Editor: Mercedes Pascual11
PMCID: PMC3999589  PMID: 24843017

Abstract

Malaria transmission is spatially heterogeneous. This reduces the efficacy of control strategies, but focusing control strategies on clusters or ‘hotspots’ of transmission may be highly effective. Among 1500 homesteads in coastal Kenya we calculated (a) the fraction of febrile children with positive malaria smears per homestead, and (b) the mean age of children with malaria per homestead. These two measures were inversely correlated, indicating that children in homesteads at higher transmission acquire immunity more rapidly. This inverse correlation increased gradually with increasing spatial scale of analysis, and hotspots of febrile malaria were identified at every scale. We found hotspots within hotspots, down to the level of an individual homestead. Febrile malaria hotspots were temporally unstable, but 4 km radius hotspots could be targeted for 1 month following 1 month periods of surveillance.

DOI: http://dx.doi.org/10.7554/eLife.02130.001

Research organism: human

eLife digest

Malaria remains a formidable threat to public health in tropical regions. The parasite that causes the disease is transmitted to humans by bites from infected mosquitoes, and the complicated lifecycle of the parasite makes developing vaccines difficult. However, preventive strategies are effective at reducing the spread of malaria. The two most widely used and effective strategies are the use of pesticide-treated bed nets to create a barrier between sleeping families and biting mosquitoes, and indoor residual spraying to reduce the numbers of mosquitoes biting sleeping families in homesteads. Other potential preventive strategies include killing mosquito larvae in breeding sites and mass anti-malarial drug treatment for infected humans.

Targeting preventive efforts to malaria hotspots—the areas where the risk of malaria transmission is greatest—may help to eliminate malaria more efficiently. Unfortunately, identifying hotspots is complicated as there are many different factors that affect how malaria spreads. These factors range from ecological conditions such as rainfall and soil type, to human effects like population density and migration.

Bejon et al. have examined the patterns of malaria transmission in Kenya over 9 years. Over this period, 54% of children who went to health clinics with a fever tested positive for the parasite that causes malaria. Infected children from areas with the highest rate of malaria infection were, on average, younger than those from less infected regions. This makes sense as in highly affected areas children have a greater chance of encountering the parasite at an early age. They are therefore more likely to get malaria when younger and, as exposure to the parasite can provide some immunity to a child, they are also less likely to get infected again when older.

In addition, mapping the spread of malaria reveals hotspots at different geographical scales. Bejon et al. could see hotspots within hotspots, and in some cases could go as far as identifying the individual homesteads most at risk of malaria. Public health workers could potentially use these analyses to identify areas that are likely to be hotspots and then target preventive measures there for the next month. However, the constantly changing locations of the hotspots means workers would have to reanalyse the data and retarget their interventions at the end of each month.

DOI: http://dx.doi.org/10.7554/eLife.02130.002

Introduction

The transmission of infectious disease often shows substantial heterogeneity (Woolhouse et al., 1997). Malaria transmission is determined by mosquito ecology and behavior, which is in turn determined by rainfall, hydrology, soils, human behavior and population distributions, and a range of other social, biotic and abiotic factors. Heterogeneity of malaria transmission is apparent at global scale (Gething et al., 2011), regional scale (Kleinschmidt et al., 2001a; Noor et al., 2009), and at fine scale in, for instance, Mali (Gaudart et al., 2006), Ghana (Kreuels et al., 2008), Ethiopia (Yeshiwondim et al., 2009) Kenya (Brooker et al., 2004; Ernst et al., 2006; Bejon et al., 2010), and Tanzania (Bousema et al., 2010). This spatial heterogeneity makes transmission relatively resilient to indiscriminate control efforts, but also provides an opportunity to engage in targeted malaria control on clusters of transmission (or ‘hotspots’), a strategy that is predicted to be highly effective (Dye and Hasibeder 1986; Woolhouse et al., 1997).

We have previously identified hotspots of malaria using active surveillance (Bejon et al., 2010). Others have identified hotspots using passive surveillance in health facilities linked to demographic surveillance systems (Ernst et al., 2006). Passive surveillance is more readily scaled up, but may be biased by variations in access to health care facilities and socially-determined health-seeking behavior (Sumba et al., 2008; Franckel and Lalou 2009). The incidence of febrile malaria presenting to health care is thus biased by access to care. This bias may be countered by using the malaria positive fraction (MPF) among children with fever (also termed ‘slide positivity rate’ in some publications [Jensen et al., 2009]). The MPF includes all febrile children presenting to the dispensary as the denominator, hence controlling for access to health care, in contrast to incidence for which all children in the community are included in the denominator. The MPF is less likely to show systematic spatial bias with distance from the health facility since parental accounts of illness have not been found to discriminate malaria from non-malarial fever (Luxemburger et al., 1998; Mwangi et al., 2005), and diagnostic testing is not available outside the dispensary.

We present data from demographic surveillance linked to passive case detection in Pingilikani dispensary in Kilifi District, coastal Kenya. Data are collected from 1500 homesteads within an 8 km radius followed for 9 years. We analyse the spatial heterogeneity of malaria cases in order to determine the temporal and spatial scales of case clustering so as to inform targeting in malaria control programmes. We also excluded visits with specific symptoms such as skin infections or cutaneous abscesses, otitis media, and gastroenteritis (>4 episodes diarrhoea per day) that might have been the primary motivation for seeking health care rather than fever per se.

Results

Among ∼20,000 remaining febrile presentations from ∼1500 different residences, 54% were positive for Plasmodium falciparum on blood smear examination. Using homestead as our unit of analysis, we found that the incidence of dispensary attendance declined with distance from the dispensary (on average −0.040 (95%CI 0.036–0.044) and −0.041 (95%CI 0.037–0.046) episodes per child year for each km for malaria smear positive and negative attendees, respectively). MPF was not found to vary significantly by distance of residence from the dispensary (from MPF = 0.50, 95%CI 0.47 to 0.54 at <2 km distance to MPF = 0.52, 95%CI 0.47 to 0.57 at 6–7 km, p=0.7).

The spatio-temporal distribution of MPF by homestead is shown in Video 1 (slow speed) and Video 2 (fast speed). The visual impression from these clips suggests marked spatial variation, with some geographical areas showing persistently high MPFs, and other areas showing more marked temporal variation. Temporally stable spatial heterogeneity would be expected to lead to spatial heterogeneity in the acquisition of immunity, which may be evidenced by variation in the age profiles of children with febrile malaria. We therefore tested this hypothesis as below.

Video 1. Each plotted point represents an individual homestead, where the colour shading indicates the malaria positive fraction (MPF), with red shading for high MPF and blue shading for low MPF.

Download video file (669.3KB, wmv)
DOI: 10.7554/eLife.02130.003

Points change colour each year.

DOI: http://dx.doi.org/10.7554/eLife.02130.003

Video 2. Each plotted point represents an individual homestead, where the colour shading indicates the malaria positive fraction (MPF), with red shading for high MPF and blue shading for low MPF.

Download video file (489.6KB, wmv)
DOI: 10.7554/eLife.02130.004

Points change color each year. The frames are identical to those in Video 1, but move more rapidly.

DOI: http://dx.doi.org/10.7554/eLife.02130.004

Spatial heterogeneity in malaria risk and acquisition of immunity

MPF was inversely correlated with the average age of children with malaria, Spearman's rank correlation (rs) = −0.16, p<0.0001 (Figure 1A–C). This suggests that greater exposure to malaria (i.e., high MPF) leads to more rapid acquisition of immunity as children grow up, hence predominantly younger children visiting the dispensary with febrile malaria. There was no evidence that this relationship was confounded by spatial clustering of age: the average age of children with non-malarial fever did not show spatial clustering (Moran's I = 0.01, p=0.5 within 1 km and Moran's I = 0.02, p=0.5 within 5 km) and was not associated with MPF (rs = −0.02, p=0.4). We examined the effect of spatial scale at which this correlation occurred by imposing grids of increasing cell size on the study area, calculating rs within each cell of the grid, and then estimating the mean rs at each scale of grid (Figure 1D, blue lines). The mean rs trended gradually away from 0 as the grid divisions became larger in scale. This pattern suggests gradual differentiation in transmission characteristics as the distance between homesteads included within a cell of the grid increases. We then examined the patterns seen on applying this analysis to simulated data. In order to exclude that this trend was a result of cells at fine-scale containing fewer homesteads, we ran permutations of the data using after randomly re-assigning spatial coordinates to the homesteads. These permutations show that a consistent correlation at rs = −0.16 throughout the range of grid sizes, albeit with greater uncertainty with smaller cell size (Figure 1D, red lines). Hence, the trend of a gradually increasing inverse correlation as the grid size increases does not appear to be explained simply by having fewer homesteads in each cell at fine scale. In order to determine the pattern that might be seen with specific spatial scales of clustering, we conducted further simulations by imposed patterns with specific scales on the spatial coordinates of the homesteads, in varying proportions with random noise using a gamma distribution. These simulations show that a specific scale of clustering produces ‘spikes’ in rs as the cell size varies, with the position of the spike coinciding with scale of the clustering (Figure 1—figure supplement 1). Reducing the Signal:Noise ratio eventually obscured the ‘spikes’ due to a characteristic pattern, but only at the point where the overall correlation was no longer discernible (Figure 1—figure supplement 2). Adding a gradient to the simulated characteristic scale attenuated but did not obscure the ‘spikes’ (Figure 1—figure supplement 3).

Figure 1. Geographical distribution of malaria positive fraction and average age of febrile malaria.

Each plotted point represents an individual homestead, where the colour shading indicates the malaria positive fraction (MPF) in panel A, or the average age of children who test positive for malaria in panel B. Panel C shows the scatter plot for MPF vs average age (Spearman's rank correlation coefficient (rs) = −0.16, p<0.0001). Panel D shows rs (y axis) plotted against scale of analysis (x axis), where a grid with varying cell size is imposed on the study area, rs is calculated within each cell and then the mean rs presented, with 95% confidence intervals produced by boot-strap (blue solid and dashed lines, respectively), and the results of analysis of spatially-random permutations of the data with equivalent cell size are shown for comparison (red solid and dashed lines, respectively). The analysis shown in panel D was compared on simulations with varying simulated characteristic scales, Signal:Noise ratios and with added gradients (Figure 1—figure supplements 1–3, respectively).

DOI: http://dx.doi.org/10.7554/eLife.02130.005

Figure 1.

Figure 1—figure supplement 1. Simulated data with varying imposed scales of clustering.

Figure 1—figure supplement 1.

Simulated data using imposed spatial clustering at specific scales are analysed to determine rs (y axis) plotted against scale of analysis (x axis), where a grid with varying cell size is imposed on the study area, rs is calculated within each cell and then the mean rs presented, with 95% confidence intervals produced by boot-strap (blue solid and dashed lines, respectively). The six panels show the appearances of different imposed scales as shown in the sub-titles.
Figure 1—figure supplement 2. Simulated data with varying signal to noise ratios.

Figure 1—figure supplement 2.

Simulated data using imposed spatial clustering at specific scales are analysed to determine rs (y axis) plotted against scale of analysis (x axis), where a grid with varying cell size is imposed on the study area, rs is calculated within each cell and then the mean rs presented, with 95% confidence intervals produced by boot-strap (blue solid and dashed lines, respectively). The six panels show the appearances using different Signal:Noise ratios.
Figure 1—figure supplement 3. Simulated data with varying gradients around imposed scales of clustering.

Figure 1—figure supplement 3.

Simulated data using imposed spatial clustering at specific scales are analysed to determine rs (y axis) plotted against scale of analysis (x axis), where a grid with varying cell size is imposed on the study area, rs is calculated within each cell and then the mean rs presented, with 95% confidence intervals produced by boot-strap (blue solid and dashed lines, respectively). The six panels show the appearances using gradients of varying spatial scales around the simulated clustering.

Hotspots within hotspots

Using the Bernoulli model in SaTScan (Kulldorff, 1997), we identified a hotspot with a radius of 5.8 km at p<0.00001 (Figure 2A) using the full data set (for which n = 20,702). However, on re-analysis of the children within this hotspot (in which n = 5300), we identified a further hotspot (with a radius of 0.76 km) within the 5.8 km hotspot (p<0.00001, Figure 2B). Then on further re-analysis of the homesteads within that 0.76 km hotspot (within which n = 1406), we identified a third significant hotspot (p=0.016) which comprised a single homestead, in which there were 36 episodes of malaria compared with 3 malaria negative fevers (Figure 2D). When we selected a random 5-km square area outside the original 5.8 km radius hotspot, we identified a hotspot within this area a fourth hotspot with a 1.32 km radius (p<0.00001, Figure 2C).

Figure 2. Hotspots within hotspots.

Each plotted point represents an individual homestead, where the colour shading indicates the malaria positive fraction (MPF). Hotspots are identified using SATScan, using the whole study area (panel A), then repeated within the hotspot (panel B), within the hotspot of panel B (panel D), and then within a randomly chosen area outside the hotspot (panel C). The semi-variogram and log–log semi-variogram plot are shown in Figure 2—figure supplements 1 and 2, respectively.

DOI: http://dx.doi.org/10.7554/eLife.02130.009

Figure 2.

Figure 2—figure supplement 1. Semi-variogram.

Figure 2—figure supplement 1.

The semi-variogram is shown for MPF. A lowess smoothed line is superimposed on the data points.
Figure 2—figure supplement 2. Log-log plot of semi-variogram.

Figure 2—figure supplement 2.

The log–log plot of the semi-variogram is shown for MPF. A lowess smoothed line is superimposed on the data points.

To further explore the scale of spatial clustering, we plotted the semivariogram (Figure 2—figure supplement 1) and the log–log transformed semivariogram (Figure 2—figure supplement 2). These plots suggested linear fits for the semivariogram, suggesting that spatial clustering occurred over a range of spatial scales.

Temporal trends of spatial heterogeneity

We also examined temporal trends for individual homesteads (Figure 3). There was an inverse correlation between the mean MPF and the variance in MPF over the 10-year study period (rs = −0.61, p<0.0001, Figure 3A). The temporal trends for two subsets of homestead can be seen in Figure 3B (stable high MPF) and Figure 3C (unstable low MPF), suggesting that homesteads can be characterized as stable high transmission homesteads or unstable low transmission homesteads. Infant parasite rates have been proposed as a measure of transmission intensity that minimizes the offsetting of acquired immunity in macro-epidemiological studies (Snow et al., 1996). We therefore hypothesized that the malaria positive fractions in children <1 year of age (hereafter ‘MPF<1yr’) would measure transmission intensity without the offsetting of acquired immunity, and that unstable transmission would result in higher risk of malaria in older children. To test this hypothesis, we calculated the mean MPF<1yr and the variance in MPF<1yr for each homestead over the 9 years of follow up and tested the relationships between these metrics and risk of malaria in older children in multivariable linear regression models.

Figure 3. Temporal variations in malaria positive fraction.

Figure 3.

(Panel A) shows the scatter plot of individual homesteads by mean malaria positive fraction (MPF) on the x axis vs variance in MPF on the y axis (rs = −0.61, p<0.0001). A labelled blue circle indicates subset q (homesteads with high variance but low mean MPF) and subset p (homesteads with low variance and high mean MPF). The temporal trends for these two subsets are shown on panels (B and C), respectively. The median trend for the study area is shown in red. (Panel D) shows the regression coefficients (y axis) for the malaria positive fractions (MPF) in older children when regressed on; (i) the mean MPF in children <1 year of age (MPF<1y) and (ii) MPF in older children when regressed on the variance in MPF<1y over the 9 years of the study. Separate multivariable regression models (i.e., with mean MPF<1y and variance in MPF<1y as explanatory variables) are fit for each age group as shown on the x axis (excluding children <1 year of age, whose data are used to calculate MPF<1y).

DOI: http://dx.doi.org/10.7554/eLife.02130.012

In multivariable linear regression models, MPF<1yr was strongly correlated with MPFs in children in the 1- to 2-year-old and 2- to 3-year-old age group, but progressively less strongly correlated with MPF in older children (Figure 3Di). The regression coefficient was ∼0.4 for 1–2 year olds, meaning that each unit increase in MPF<1yr is associated with a 0.4 increase in the MPF for 1- to 2-year-old children. On the other hand, the variance in MPF<1yr was not correlated with MPFs in 1- to 2- or 2- to 3-year-old children, but was progressively more strongly correlated with MPF in older children (Figure 3Dii). Hence there were high stable transmission homesteads, with predominantly younger children getting febrile malaria, and low unstable transmission homesteads, with increasing risk to older children. This pattern of high stable vs low unstable transmission also occurs between regions or countries, and demonstrates a similarity between the micro- and macro-epidemiology of malaria (Hay et al., 2008).

Theoretical accuracy of targeted control undertaken at varying temporal and spatial scales

We then used our data set to simulate the accuracy of targeting cases that a malaria control programme might achieve on conducting surveillance over a defined period of time followed by targeted control. We assumed that malaria control programmes would need to define a priori the period of time to use for surveillance, and also to select a spatial scale at which to define hotspots. For varying time periods and spatial scales, we determined the % of excess malaria cases within the targeted hotspots compared with the surrounding area in the period of time immediately following the simulated surveillance.

One week periods of surveillance (top left panel of Figure 4) did not identify hotspots that are still present the following week at fine spatial scales (i.e., the plotted line indicates that the accuracy of targeting is 0% at scales of less than 1 km). On the other hand, at larger spatial scales we found that 1 week periods of surveillance were more accurate, resulting in the targeting of areas with a 60% excess of new malaria cases compared with the surrounding area at a scale of an 8 km diameter. A similar pattern was seen for monthly periods of surveillance. Longer surveillance periods (e.g., 6 months) resulted in targeting areas with an excess of 20% malaria cases compared with the surrounding area over the range of spatial scales examined.

Figure 4. Theoretical accuracy of targeted control undertaken at varying temporal and spatial scales.

Figure 4.

The accuracy of varying strategies of hotspot identification is shown. Each panel is labelled with the time period of surveillance data used. The x axis shows the diameter of hotspot defined. In each case hotspots were selected to account for 20% of the homesteads in the area. The y axis shows the increase that would have been present assuming that they were targeted in the time period following their identification.

DOI: http://dx.doi.org/10.7554/eLife.02130.013

ITN use and spatial variation in risk

Mass distributions of Insecticide Treated Nets (ITNs) in the area began in 2006. ITN use was surveyed in 2009 and 2010. We found that children using ITNs had a reduced risk of malaria by logistic regression (i.e., OR = 0.69, 95%CI 0.67 to 0.8, p<0.001), in keeping with previous literature on the personal protection provided by ITN use (Lim et al., 2011). On the other hand, we did not identify significant evidence that ITN use was clustered spatially (Moran's I = 0.02, p=0.5). Furthermore, adding ITN use as a covariate in SaTScan analysis to locate hotspots had little effect on results; the addition of ITN use as a covariate changed the location of the hotspot by 120 m, and changed the predicted radius of the hotspot from 5.4 km to 5.2 km. On re-analysis of the homesteads within the 5.4 km hotspot, a further 0.87 km hotspot was identified the position and radius of which were not altered by the inclusion of ITN use as a covariate. Finally, within this 0.87 km hotspot the same 7 homesteads were identified as a hotspot irrespective of the inclusion of ITN use as a covariate. We did not identify significant evidence that ITN use correlated mean MPF<1yr (rs = −0.04, p=0.04) or with the variance in MPF<1yr (rs = −0.01, p=0.7). Hence, ITNs provided personal protection from malaria, but we were unable to show that they explained the spatial micro-epidemiological patterns.

Discussion

We found that malaria cases were spatially heterogeneous in an 8-km radius area of coastal Kenya. The strongly significant inverse correlation between the malaria positive fraction (MPF) and average age of children presenting with malaria suggests variable acquisition of immunity between homesteads. Homesteads at high transmission intensity have a high MPF and a young average age of malaria (with older children becoming immune and therefore not presenting to the dispensary) whereas homesteads at low transmission intensity have a low MPF but an older average age of malaria since older children are not becoming immune as rapidly. In theory, this inverse correlation might have arisen because of heterogeneity at various spatial scales. For instance, there might have been a block of homesteads all at high transmission in one half of the study area (thus with high MPF and low average age) and a second block of homesteads at low transmission in the other half (with low MPF and high average age). On the other hand, the inverse correlation might have arisen because of a random distribution of ‘high’ and ‘low’ transmission intensity homesteads throughout the study area.

To determine at which spatial scale transmission was heterogeneous, we conducted an analysis where correlation coefficient was recalculated within each cell of a grid superimposed on the study area. The mean correlation coefficient of all cells was then presented as the cell size of the grid used was increased (Figure 1D). This analysis was done to identify the most influential geographical scale at which the inverse correlation was observed. In simulated data, we noted ‘spikes’ where the inverse correlation was abruptly lost when the size of cells in the grid coincides with the size of the geographical ‘blocks’ of homesteads that drove the inverse correlation, as seen in Figure 1—figure supplement 1. Similar spikes were seen after adding simulated noise and gradients in space over which the correlation varied (Figure 1—figure supplements 1, 2 and 3). Real-world data would contain more complex sources of variation than we have simulated, and hence may not produce distinct spikes. Nevertheless, the analysis of these simulations suggests that discontinuities in the correlation between MPF and average age of malaria over cell size might be expected when clustering is at a specific spatial scale. In fact there was no such discontinuity in the function shown in Figure 1D, indicating that the inverse correlation was present at every geographical scale examined within our study. It is likely that this pattern would extend at greater geographical scales, since a similar inverse correlation between the age distributions of malaria cases and transmission intensity can be seen on comparing countries and regions (Okiro et al., 2009).

The pattern of spatial heterogeneity is relevant to malaria control, since targeted disease control is predicted to be highly effective (Woolhouse et al., 1997). Spatial targeting is particularly appropriate for malaria ‘hotspots’ (Coleman et al., 2009; Moonen et al., 2010; Bousema et al., 2012; Sturrock et al., 2013) and many malaria control programmes are already engaged in spatially-targeted intervention (Zhou et al., 2010; Loha et al., 2012). Our data showing clustering at varying spatial scales suggest that malaria control programs can expect to identify hotspots at many different geographical scales. We demonstrate that hotspots occur within hotspots, down to the level of a single homestead, and also that hotspots can be identified on ‘zooming in’ on random areas outside the main hotspot (Figure 2C). These hotspots were based on analysis of a large dataset with adequate power, and were strongly significant based on the multiple permutations run in SaTScan, suggesting that type I statistical error is an unlikely explanation for our findings. The complexity of presenting ‘hotspots within hotspots’ to a malaria control programme is further compounded by the temporal instability of the spatial pattern (Figure 3).

We therefore simulated the accuracy with which hotspots could be targeted using varying spatial scales and varying time periods of surveillance. We found that using data aggregated over 1 month of surveillance to define 4 to 8 km diameter hotspots would provide greatest accuracy, but this information is only relevant for 1 month before temporal instability necessitates further surveillance. One might therefore consider a continuous programme of parallel surveillance and targeting, where the surveillance data are examined at the end of each month to determine the location to be targeted for the following month. Continuous surveillance would allow adaptive targeting of hotspots for the following month. Such a strategy might be employed all year round, or for a limited period of the year depending on local seasonality. (Cairns et al., 2012) Targeting at this spatial scale has the added practical advantage that it could be done with village-level location data and would not require fine-scale geo-positional data.

There are some caveats to this recommendation. Our observations are from a single site. Other sites should examine their local data to determine whether a similar targeting strategy is appropriate. Furthermore, some hotspots did show temporal stability. For instance, we identified a 6 km diameter hotspot south east of the dispensary that maintained a 30–60% increase in MPF compared with the surrounding area throughout the 9-year surveillance.

Children with positive microscopy slides for malaria presenting at the dispensary may have genuine febrile malaria, or alternatively may have chronic asymptomatic parasitaemia with co-incident non-malarial fever. Previous studies estimating malaria attributable fractions in the locality suggest 61% of the children in our analysis would have malaria as the proximate cause of their illness, with the other 39% having chronic asymptomatic parasitaemia with co-incident fever from another cause (Olotu et al., 2011). We have previously demonstrated that spatial heterogeneity is more temporally stable when analysed for asymptomatic parasitaemia rather than febrile malaria (Bejon et al., 2010). Targeting hotspots of asymptomatic parasitaemia would require community surveys rather than dispensary monitoring, which may need to be done less frequently than monitoring of febrile malaria episodes.

Furthermore MPF is not a comprehensive indicator of transmission intensity. Homesteads with consistently low average ages of febrile malaria are likely to be stable high transmission homesteads (such as those in subset p of Figure 3A) which amplify transmission in the areas surrounding them. Targeting such high transmission homesteads to interrupt transmission may be highly effective (Woolhouse et al., 1997). The stronger inverse correlation between MPF and average age of febrile malaria as spatial scale increases (Figure 1) suggests that the spatial heterogeneity of transmission is progressively more stable at more coarse spatial scales.

Malaria transmission is determined by mosquito ecology and behavior. Mosquito ecology may be determined by obvious geographical features such as altitude (Reyburn et al., 2005), cultivation practices (Lindsay et al., 1991), streams and dams (Ghebreyesus et al., 1999), wind direction (Midega et al., 2012) and mosquito searching behaviour for hosts (Smith et al., 2004). Ecological models based on such features have been developed using frequentist techniques (Omumbo et al., 2005), Bayesian approaches (Craig et al., 2007), and fuzzy logic (Snow et al., 1998). However, the same ecological factor may act inconsistently in different geographical areas (Kleinschmidt et al., 2001b; Gemperli et al., 2006; Noor et al., 2008), and the effect of ecological factors is modified by fine-scale vector and host movement (Perkins et al., 2013). Our data suggests that the environmental factors determining malaria transmission operate at a range of spatial scales. We might speculate that mosquito breeding site density could be equally influenced by proximity to a large geographical feature such as a river, or to a micro-geographical feature such as a cow hoof-print (Sattler et al., 2005). Hence ecological models of malaria transmission will need to include data at a range of spatial scales in order to accurately predict malaria risk.

Materials and methods

Approval for human participation in these cohorts was given by Kenya Medical Research Institute Ethics Research Committee, and research was conducted according to the principles of the declaration of Helsinki.

Study population

Pingilikani Dispensary is 40 km to the North of Mombasa, in Kilifi Country, Coast Province, Kenya. The population relies mainly on subsistence farming and experiences all year round malaria transmission, with ‘long’ and ‘short’ rains each year causing two peaks in transmission. Estimates of the local EIR were 22–53 in 2003 (1), and 21.7 infective bites per person per year in 2010 (2). Between 2003 and 2011, data were collected on all children (i.e., ≤15 years of age) attending the dispensary.

Demographic surveillance is conducted for the 240,000 people in a 900 square kilometre area in Kilifi County. Four-monthly enumeration rounds were conducted to identify births, deaths, and migration (3). Each inhabitant is described by their family relationships and their homestead of residence, with geospatial coordinates, and assigned a unique personal identifier. These details were used to link children visiting Pingilikani dispensary to geospatial coordinates for the homestead of residence. During enumeration rounds in 2009–2011 ITN use per individual was established during visits to the homestead, as reported by a homestead representative.

We restrict analysis to within an 8 km radius of the dispensary, which accounted for >96% of all visits to the dispensary and excluded visits with specific symptoms such as skin infections or cutaneous abscesses, otitis media, and gastroenteritis (>4 episodes diarrhoea per day) that might have been the primary motivation for seeking health care rather than fever per se. These latter exclusions combined accounted for 14% of all visits.

Malaria diagnosis and treatment

All children presenting for assessment (except those with trauma as their only concern) had finger-prick blood samples examined for malaria parasites. Thick and thin blood smears were stained with 10% Giemsa and examined at x1000 magnification for asexual Plasmodium falciparum parasites. 100 fields were examined before slides could be considered negative. Amodiaquine was the first-line anti-malarial from 2003 to 2005, when policy changed to Co-artemether.

Analysis

Fever was defined as either reported fever by the parents or measured fever, that is, axilliary temperature ≥37.5°C (Mackowiak et al., 1997). The malaria positive fraction (MPF) was calculated as the fraction of febrile children attending the dispensary with fever who were positive for malaria parasites by blood smear examination. MPF was aggregated by homestead. Multiple identifications of fever and parasitaemia in the same child within 21 days were considered a single episode.

The average age of febrile malaria was calculated as the arithmetic mean age at which children visited the dispensary with fever and malaria parasites. Correlations between average age of febrile malaria and MPF per homestead were calculated using spearman's rank correlation coefficient. Grids of gradually increasing cell size were calculated using longitude and latitude coordinates. Simulations were done using the distribution of homesteads identified in our study. We applied a factor to MPF (positive) and average age (negative) to the homesteads within a block of varying size to induce the appearance of clustering at a given spatial scale. Random noise was added to these simulations using a gamma distribution. In the first round of simulations we set the Signal:Noise ratio (i.e., the ratio between the factor applied to MPF and average age vs the mean amplitude of the noise) to reproduce the rs seen in the real data. In the second round of simulations, we varied the Signal:Noise Ratio as shown in individual panels, and in the third round of simulations we introduced a gradient over which the correlation emerged, where the factor applied to MPF and average age was tapered in a uniform way towards 1 beginning at the edge of the simulated block.

Hotspots were defined using SaTScan software to calculate the spatial scan statistic (Kulldorff, 1997). The software is freely available and can be downloaded from www.satscan.org. The version used in this analysis was downloaded in November 2012, as v9.1 for a 64-bit system. The spatial scan statistic uses a scanning window that moves across space. The scanning windows are circles centred on each homestead, with a radius varied from inclusion of only the single homestead it is centred on through to 30% of the population size. When using the Bernoulli model, the software calculates the fraction of cases/controls inside vs outside the each possible scanning window, and selects the window giving the highest probability of a case within the scanning window compared with the probability of a case outside the window. In our application of the Bernoulli model, cases were febrile children with parasitaemia and controls were febrile children without parasitaemia. The test of significance needs to take into account the whole process of selecting the optimal window rather than simply the comparison of inside vs outside the optimal window. This is achieved by running random permutations of the case/control data over the spatial co-ordinates of homesteads and determining the log-likelihood statistic for the model fit by the optimal window for each random permutation. The log-likelihood statistic for the real data is then compared with the statistics on the random permutations to derive a p value. We used 9999 replications in our study. The maximum hotspot size was set at 30% of the population, and the inference level for significance was set at 0.05. The main analysis was done without adjustment for covariates, and a secondary analysis was conducted for the 2009/2010 data with and without ITN use as a covariate. Kernel smoothing with a 1 km radius is used for spatial display graphs, but all analyses of correlation are conducted on raw data without smoothing.

Semivariograms, Moran's I and linear regression models were run in Stata version 12 (StataCorp, Texas). Semivariograms were constructed using 0.1 km intervals between 0.1 km and 10 km. Moran's I was assessed globally using cumulative bands of <0.1, <0.5, <1 and <2 and <5 kms.

Acknowledgements

Peter D Crompton is thanked for helpful comments during manuscript drafting. The manuscript is published with the permission of the Director of KEMRI. PB is jointly funded by the UK Medical Research Council (MRC) and the UK Department for International Development (DFID) under the MRC/DFID Concordat agreement. Work in Pingilikani was funded by the German Research Foundation (DFG, Grant number SFB 544, A7) and by the Wellcome Trust. Bonston Piri and Epson Mwadori are thanked for their contributions in making the geospatial data available. SIH is funded by a Senior Research Fellowship from the Wellcome Trust (095066).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grants:

  • Wellcome Trust FundRef identification ID: http://dx.doi.org/10.13039/100004440 083579 to Philip Bejon.

  • Medical Research Council (UK) G1002624 to Philip Bejon.

  • UK Department for International Development G1002624 to Philip Bejon.

  • German Research Foundation SFB 544, A7 to Steffen Borrmann.

Additional information

Competing interests

The authors declare that no competing interests exist.

Author contributions

PB, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

PWG, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

BG, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

TNW, Acquisition of data, Drafting or revising the article.

DB, Acquisition of data, Drafting or revising the article.

MO, Acquisition of data, Drafting or revising the article.

JP, Acquisition of data, Drafting or revising the article.

MB, Acquisition of data, Drafting or revising the article.

CN, Acquisition of data, Analysis and interpretation of data.

SIH, Analysis and interpretation of data, Drafting or revising the article.

DLS, Analysis and interpretation of data, Drafting or revising the article.

TB, Conception and design, Acquisition of data, Drafting or revising the article.

EB, Conception and design, Acquisition of data, Drafting or revising the article.

SB, Conception and design, Acquisition of data, Drafting or revising the article.

KM, Conception and design, Drafting or revising the article.

Ethics

Human subjects: Informed consent for participation was obtained, and specific ethical approval was obtained from the KEMRI Ethical Review Committee (SSC Protocol No. 2413: Spatial Epidemiology of Malaria Cases in the Kilifi District Demographic Surveillance Area). The KEMRI ethical review committee required that participants consent for participation in research and for their data to be stored, but does not require a further explicit statement consenting to publication. Our institutional guidelines would require this only in the event that individuals were identifiable in the publication.

References

  1. Bejon P, Williams TN, Liljander A, Noor AM, Wambua J, Ogada E, Olotu A, Osier FH, Hay SI, Färnert A, Marsh K. 2010. Stable and unstable malaria hotspots in longitudinal cohort studies in Kenya. PLOS Medicine 7:e1000304. doi: 10.1371/journal.pmed.1000304 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bousema T, Drakeley C, Gesase S, Hashim R, Magesa S, Mosha F, Otieno S, Carneiro I, Cox J, Msuya E, Kleinschmidt I, Maxwell C, Greenwood B, Riley E, Sauerwein R, Chandramohan D, Gosling R. 2010. Identification of hot spots of malaria transmission for targeted malaria control. The Journal of Infectious Diseases 201:1764–1774. doi: 10.1086/652456 [DOI] [PubMed] [Google Scholar]
  3. Bousema T, Griffin JT, Sauerwein RW, Smith DL, Churcher TS, Takken W, Ghani A, Drakeley C, Gosling R. 2012. Hitting hotspots: spatial targeting of malaria for control and elimination. PLOS Medicine 9:e1001165. doi: 10.1371/journal.pmed.1001165 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Brooker S, Clarke S, Njagi JK, Polack S, Mugo B, Estambale B, Muchiri E, Magnussen P, Cox J. 2004. Spatial clustering of malaria and associated risk factors during an epidemic in a highland area of western Kenya. Tropical Medicine & International Health: TM & IH 9:757–766. doi: 10.1111/j.1365-3156.2004.01272.x [DOI] [PubMed] [Google Scholar]
  5. Cairns M, Roca-Feltrer A, Garske T, Wilson AL, Diallo D, Milligan PJ, Ghani AC, Greenwood BM. 2012. Estimating the potential public health impact of seasonal malaria chemoprevention in African children. Nature Communications 3:881. doi: 10.1038/ncomms1879 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Coleman M, Mabuza AM, Kok G, Coetzee M, Durrheim DN. 2009. Using the SaTScan method to detect local malaria clusters for guiding malaria control programmes. Malaria Journal 8:68. doi: 10.1186/1475-2875-8-68 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Craig MH, Sharp BL, Mabaso ML, Kleinschmidt I. 2007. Developing a spatial-statistical model and map of historical malaria prevalence in Botswana using a staged variable selection procedure. International Journal of Health Geographics 6:44. doi: 10.1186/1476-072X-6-44 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dye C, Hasibeder G. 1986. Population dynamics of mosquito-borne disease: effects of flies which bite some people more frequently than others. Transactions of the Royal Society of Tropical Medicine and Hygiene 80:69–77. doi: 10.1016/0035-9203(86)90199-9 [DOI] [PubMed] [Google Scholar]
  9. Ernst KC, Adoka SO, Kowuor DO, Wilson ML, John CC. 2006. Malaria hotspot areas in a highland Kenya site are consistent in epidemic and non-epidemic years and are associated with ecological factors. Malaria Journal 5:78. doi: 10.1186/1475-2875-5-78 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Franckel A, Lalou R. 2009. Health-seeking behaviour for childhood malaria: household dynamics in rural Senegal. Journal of Biosocial Science 41:1–19. doi: 10.1017/S0021932008002885 [DOI] [PubMed] [Google Scholar]
  11. Gaudart J, Poudiougou B, Dicko A, Ranque S, Toure O, Sagara I, Diallo M, Diawara S, Ouattara A, Diakite M, Doumbo OK. 2006. Space-time clustering of childhood malaria at the household level: a dynamic cohort in a Mali village. BMC Public Health 6:286. doi: 10.1186/1471-2458-6-286 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Gemperli A, Sogoba N, Fondjo E, Mabaso M, Bagayoko M, Briët OJ, Anderegg D, Liebe J, Smith T, Vounatsou P. 2006. Mapping malaria transmission in West and Central Africa. Tropical Medicine & International Health: TM & IH 11:1032–1046. doi: 10.1111/j.1365-3156.2006.01640.x [DOI] [PubMed] [Google Scholar]
  13. Gething PW, Patil AP, Smith DL, Guerra CA, Elyazar IR, Johnston GL, Tatem AJ, Hay SI. 2011. A new world malaria map: Plasmodium falciparum endemicity in 2010. Malaria Journal 10:378. doi: 10.1186/1475-2875-10-378 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Ghebreyesus TA, Haile M, Witten KH, Getachew A, Yohannes AM, Yohannes M, Teklehaimanot HD, Lindsay SW, Byass P. 1999. Incidence of malaria among children living near dams in northern Ethiopia: community based incidence survey. BMJ: British Medical Journal 319:663–666. doi: 10.1136/bmj.319.7211.663 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Hay SI, Smith DL, Snow RW. 2008. Measuring malaria endemicity from intense to interrupted transmission. The Lancet Infectious Diseases 8:369–378. doi: 10.1016/S1473-3099(08)70069-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Jensen TP, Bukirwa H, Njama-Meya D, Francis D, Kamya MR, Rosenthal PJ, Dorsey G. 2009. Use of the slide positivity rate to estimate changes in malaria incidence in a cohort of Ugandan children. Malaria Journal 8:213. doi: 10.1186/1475-2875-8-213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kleinschmidt I, Omumbo J, Briët O, van de Giesen N, Sogoba N, Mensah NK, Windmeijer P, Moussa M, Teuscher T. 2001a. An empirical malaria distribution map for West Africa. Tropical Medicine & International Health: TM & IH 6:779–786. doi: 10.1046/j.1365-3156.2001.00790.x [DOI] [PubMed] [Google Scholar]
  18. Kleinschmidt I, Sharp BL, Clarke GP, Curtis B, Fraser C. 2001b. Use of generalized linear mixed models in the spatial analysis of small-area malaria incidence rates in Kwazulu Natal, South Africa. American Journal of Epidemiology 153:1213–1221. doi: 10.1093/aje/153.12.1213 [DOI] [PubMed] [Google Scholar]
  19. Kreuels B, Kobbe R, Adjei S, Kreuzberg C, von Reden C, Bäter K, Klug S, Busch W, Adjei O, May J. 2008. Spatial variation of malaria incidence in young children from a geographically homogeneous area with high endemicity. The Journal of Infectious Diseases 197:85–93. doi: 10.1086/524066 [DOI] [PubMed] [Google Scholar]
  20. Kulldorff M. 1997. A spatial-scan statistic. Communications in Statistics: Theory and Methods 26:1481–1496. doi: 10.1080/03610929708831995 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Lim SS, Fullman N, Stokes A, Ravishankar N, Masiye F, Murray CJ, Gakidou E. 2011. Net benefits: a multicountry analysis of observational data examining associations between insecticide-treated mosquito nets and health outcomes. PLOS Medicine 8:e1001091. doi: 10.1371/journal.pmed.1001091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lindsay SW, Wilkins HA, Zieler HA, Daly RJ, Petrarca V, Byass P. 1991. Ability of Anopheles gambiae mosquitoes to transmit malaria during the dry and wet seasons in an area of irrigated rice cultivation in the Gambia. The Journal of Tropical Medicine and Hygiene 94:313–324 [PubMed] [Google Scholar]
  23. Loha E, Lunde TM, Lindtjørn B. 2012. Effect of bednets and indoor residual spraying on spatio-temporal clustering of malaria in a village in south Ethiopia: a longitudinal study. PLOS ONE 7:e47354. doi: 10.1371/journal.pone.0047354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Luxemburger C, Nosten F, Kyle DE, Kiricharoen L, Chongsuphajaisiddhi T, White NJ. 1998. Clinical features cannot predict a diagnosis of malaria or differentiate the infecting species in children living in an area of low transmission. Transactions of the Royal Society of Tropical Medicine and Hygiene 92:45–49. doi: 10.1016/S0035-9203(98)90950-6 [DOI] [PubMed] [Google Scholar]
  25. Mackowiak PA, Bartlett JG, Borden EC, Goldblum SE, Hasday JD, Munford RS, Nasraway SA, Stolley PD, Woodward TE. 1997. Concepts of fever: recent advances and lingering dogma. Clinical Infectious Diseases: an Official Publication of the Infectious Diseases Society of America 25:119–138. doi: 10.1086/514520 [DOI] [PubMed] [Google Scholar]
  26. Midega JT, Smith DL, Olotu A, Mwangangi JM, Nzovu JG, Wambua J, Nyangweso G, Mbogo CM, Christophides GK, Marsh K, Bejon P. 2012. Wind direction and proximity to larval sites determines malaria risk in Kilifi District in Kenya. Nature Communications 3:674. doi: 10.1038/ncomms1672 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Moonen B, Cohen JM, Snow RW, Slutsker L, Drakeley C, Smith DL, Abeyasinghe RR, Rodriguez MH, Maharaj R, Tanner M, Targett G. 2010. Operational strategies to achieve and maintain malaria elimination. Lancet 376:1592–1603. doi: 10.1016/S0140-6736(10)61269-X [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Mwangi TW, Mohammed M, Dayo H, Snow RW, Marsh K. 2005. Clinical algorithms for malaria diagnosis lack utility among people of different age groups. Tropical Medicine & International Health: TM & IH 10:530–536. doi: 10.1111/j.1365-3156.2005.01439.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Noor AM, Clements AC, Gething PW, Moloney G, Borle M, Shewchuk T, Hay SI, Snow RW. 2008. Spatial prediction of Plasmodium falciparum prevalence in Somalia. Malaria Journal 7:159. doi: 10.1186/1475-2875-7-159 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Noor AM, Gething PW, Alegana VA, Patil AP, Hay SI, Muchiri E, Juma E, Snow RW. 2009. The risks of malaria infection in Kenya in 2009. BMC Infectious Diseases 9:180. doi: 10.1186/1471-2334-9-180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Okiro EA, Al-Taiar A, Reyburn H, Idro R, Berkley JA, Snow RW. 2009. Age patterns of severe paediatric malaria and their relationship to Plasmodium falciparum transmission intensity. Malaria Journal 8:4. doi: 10.1186/1475-2875-8-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Olotu A, Fegan G, Williams TN, Sasi P, Ogada E, Bauni E, Wambua J, Marsh K, Borrmann S, Bejon P. 2010. Defining Clinical Malaria: The Specificity and Incidence of Endpoints from Active and Passive Surveillance of Children in Rural Kenya. PLoS ONE 5:e15569. doi: 10.1371/journal.pone.0015569 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Omumbo JA, Hay SI, Snow RW, Tatem AJ, Rogers DJ. 2005. Modelling malaria risk in East Africa at high-spatial resolution. Tropical Medicine & International Health: TM & IH 10:557–566. doi: 10.1111/j.1365-3156.2005.01424.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Perkins TA, Scott TW, Le Menach A, Smith DL. 2013. Heterogeneity, mixing, and the spatial scales of mosquito-borne pathogen transmission. PLOS Computational Biology 9:e1003327. doi: 10.1371/journal.pcbi.1003327 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Reyburn H, Mbatia R, Drakeley C, Bruce J, Carneiro I, Olomi R, Cox J, Nkya WM, Lemnge M, Greenwood BM, Riley EM. 2005. Association of transmission intensity and age with clinical manifestations and case fatality of severe Plasmodium falciparum malaria. JAMA: the Journal of the American Medical Association 293:1461–1470. doi: 10.1001/jama.293.12.1461 [DOI] [PubMed] [Google Scholar]
  36. Sattler MA, Mtasiwa D, Kiama M, Premji Z, Tanner M, Killeen GF, Lengeler C. 2005. Habitat characterization and spatial distribution of Anopheles sp. mosquito larvae in Dar es Salaam (Tanzania) during an extended dry period. Malaria Journal 4:4. doi: 10.1186/1475-2875-4-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Smith DL, Dushoff J, McKenzie FE. 2004. The risk of a mosquito-borne infection in a heterogeneous environment. PLOS Biology 2:e368. doi: 10.1371/journal.pbio.0020368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Snow RW, Gouws E, Omumbo J, Rapuoda B, Craig MH, Tanser FC, le Sueur D, Ouma J. 1998. Models to predict the intensity of Plasmodium falciparum transmission: applications to the burden of disease in Kenya. Transactions of the Royal Society of Tropical Medicine and Hygiene 92:601–606. doi: 10.1016/S0035-9203(98)90781-7 [DOI] [PubMed] [Google Scholar]
  39. Snow RW, Molyneux CS, Warn PA, Omumbo J, Nevill CG, Gupta S, Marsh K. 1996. Infant parasite rates and immunoglobulin M seroprevalence as a measure of exposure to Plasmodium falciparum during a randomized controlled trial of insecticide-treated bed nets on the Kenyan coast. The American Journal of Tropical Medicine and Hygiene 55:144–149 [PubMed] [Google Scholar]
  40. Sturrock HJ, Novotny JM, Kunene S, Dlamini S, Zulu Z, Cohen JM, Hsiang MS, Greenhouse B, Gosling RD. 2013. Reactive case detection for malaria elimination: real-life experience from an ongoing program in swaziland. PLOS ONE 8:e63830. doi: 10.1371/journal.pone.0063830 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Sumba PO, Wong SL, Kanzaria HK, Johnson KA, John CC. 2008. Malaria treatment-seeking behaviour and recovery from malaria in a highland area of Kenya. Malaria Journal 7:245. doi: 10.1186/1475-2875-7-245 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Woolhouse ME, Dye C, Etard JF, Smith T, Charlwood JD, Garnett GP, Hagan P, Hii JL, Ndhlovu PD, Quinnell RJ, Watts CH, Chandiwana SK, Anderson RM. 1997. Heterogeneities in the transmission of infectious agents: implications for the design of control programs. Proceedings of the National Academy of Sciences of the United States of America 94:338–342. doi: 10.1073/pnas.94.1.338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Yeshiwondim AK, Gopal S, Hailemariam AT, Dengela DO, Patel HP. 2009. Spatial analysis of malaria incidence at the village level in areas with unstable transmission in Ethiopia. International Journal of Health Geographics 8:5. doi: 10.1186/1476-072X-8-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Zhou G, Githeko AK, Minakawa N, Yan G. 2010. Community-wide benefits of targeted indoor residual spray for malaria control in the western Kenya highland. Malaria Journal 9:67. doi: 10.1186/1475-2875-9-67 [DOI] [PMC free article] [PubMed] [Google Scholar]
eLife. 2014 Apr 24;3:e02130. doi: 10.7554/eLife.02130.014

Decision letter

Editor: Mercedes Pascual1

eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.

Thank you for sending your work entitled “The Fractal Micro-epidemiology of Malaria in Coastal Kenya” for consideration at eLife. Your article has been evaluated by a Senior editor, a Reviewing editor, and 2 reviewers. They agreed that the article presents interesting and potentially important findings. They have however a number of methodological concerns and questions regarding the patterns and their interpretation, which would need to be addressed before a decision can be reached.

The Reviewing editor and the other reviewers discussed their comments, and the Reviewing editor has assembled the following comments to help you prepare a revised submission. If you believe you can address these comments within a month, please submit your revision with a description of the changes.

The study draws on the sustained work over many years of an expert team who have documented the malaria parasitaemia status of children presenting to a clinic in rural Kenya. Finding hotpots of malaria transmission is not new, as the authors readily admit, but the phenomenon has rarely been as meticulously examined and analyzed as in this study.

1) The claim on the self-similarity of the patterns and the lack of a characteristic scale needs to be substantiated further for the analyses to be convincing. In particular, it is not clear that the results of the correlation analyses at different spatial resolutions could not arise from the combination of a characteristic scale and the increasing noise from fewer data as the resolution increases. The randomization test addresses the fewer homesteads for a random pattern but not for one with a characteristic scale. It would be valuable to impose a pattern with such a scale on the distribution of homesteads in the landscape and show that this does indeed create a 'shoulder'. Additional motivation and description of the methods is needed: how were the grids constructed? Why would randomly reassigning spatial co-ordinates to homesteads lead to a correlation that declines very rapidly with grid sizes, rather than remaining constant? The choice of analysis to demonstrate hotspots at every scale is unusual in the sense of involving a correlation between two quantities and how this changes with resolution. It is important that the expectations of different kinds of patterns be clear and that the significance of the observed pattern be established.

2) The title and Introduction emphasize the 'fractal' nature of the patterns. This term implies self-similarity in a stronger sense than the lack of an identifiable characteristic scale over some range of resolutions. In this sense, the calculation of a fractal dimension is important but has been relegated here to the Discussion with no actual evidence presented to demonstrate the dimension and the range of scales over which this number was obtained. This plot and the strength of the evidence for a fractal pattern need to be included.

3) The authors need to say where they obtained the SaTScan software, what version (or when downloaded). They also need a more detailed overview of how this method finds hot spots.

4) The possibility of conflation with mean age of infection should be considered. Mean age of infection results from the interaction between infection risk, reporting risk, and the mean age of children present. Age of children present could be an important confounder of the studied relationship: areas with younger children might have higher malaria rates and lower mean age of infection even if there is no signal from the direct causal relationship between them. Ideally, the authors would construct a statistic that measures age in the malarious population relative to the control population at each spatial scale. This issue must at least be carefully discussed.

5) The discussion of ITN use is not entirely convincing. Is there a way to include ITN use in the analysis, and thus really control for it? How much may the use of impregnated bednets be spatially or temporally heterogeneous? The correlation with variance in MPF seems like a red herring, and should be dropped.

6) It would be helpful to readers to know if, within the approximately 16 km diameter of the reported area, there are any obvious geographical differences – e.g., of altitude or ground water/foliage, that might affect malaria transmission. In the Discussion the authors state that 'Accurate characterization of these patterns will inform optimized surveillance and control policies'. Is it possible that such characterization within the study area would have produced results comparable with (and more easily obtained) than the data utilized from clinic records?

7) The investigators have data on the parasitaemia prevalence among children presenting without fever (and without trauma). The suggestion here is not that the authors repeat their analysis using parasitaemia prevalence in the non-febrile but that they discuss whether similar results might have been obtained. Alternatively, were parasite densities measured on blood films, so that the likelihood that a fever is actually due to malaria could be estimated?

8) The practical implications could be written more clearly. The principal suggestion is that “the use of one month cycles of surveillance to target 4 to 8 km diameter hotspots would be optimal” – some expansion on what is meant by “cycles” and “surveillance” - is it recommended that parasitaemia prevalences among febrile children should be the instrument of surveillance? And how frequently should the one-month surveys be carried out?

9) The authors need to be more careful throughout about “negative inference”. The null hypothesis is rarely true, and never confirmed. For example: the wording at the end of the second paragraph of the Introduction is far too strong – the authors should argue that this bias is likely to be small, not that it is unlikely to exist; similarly the “cannot be used” result is presumably a lack of statistical significance and does not imply there is no difference in subjective fever experience (it may imply that this difference is small, highly variable, or both); in the first paragraph of the Results, the authors should say MPF “was not found” to vary ; in the Results section “ITN use and spatial variation in risk”, the distribution was not “random” – the authors found no significant pattern, similarly they did not find a significant correlation with variance. The authors should be aware of this point when addressing the issue of non-malarial fevers, as well.

eLife. 2014 Apr 24;3:e02130. doi: 10.7554/eLife.02130.015

Author response


1) The claim on the self-similarity of the patterns and the lack of a characteristic scale needs to be substantiated further for the analyses to be convincing. In particular, it is not clear that the results of the correlation analyses at different spatial resolutions could not arise from the combination of a characteristic scale and the increasing noise from fewer data as the resolution increases. The randomization test addresses the fewer homesteads for a random pattern but not for one with a characteristic scale. It would be valuable to impose a pattern with such a scale on the distribution of homesteads in the landscape and show that this does indeed create a 'shoulder'. Additional motivation and description of the methods is needed: how were the grids constructed?

We have done such simulations, using a variety of characteristic scales and signal:noise ratios, which are added to our revised manuscript as Supplementary Figures 1 and 2. We conducted these simulations by using the distributions of homesteads observed in the study area, and simulating an increase in malaria positive fraction (MPF) and decrease in age of children with febrile malaria with particular characteristic scale, and additional simulating a degree of random noise using a gamma distribution. We scaled the Signal:Noise ratio in each case to provide a Spearman’s rank correlation coefficient (rs) similar to that observed in our real data. We then analysed these simulated datasets in the same way as we had analysed our real data (i.e. examining the effect of spatial scale at which this correlation occurred by imposing increasingly fine-scale grids on the study area, calculating rs within each cell of the grid, and then estimating the mean rs at each scale of grid).

Interestingly, rather than a simple “shoulder” we see multiple “spikes” in rs as the cell size of the grid varies. By examining the outcomes using different scales of simulated patterns, we found that a “spike” is observed where the scale of the simulated pattern coincides with the scale of the cell size of the superimposed grid (Figure 1–figure supplement 1). Further “spikes” then coincide with points at which the scale of the simulated pattern is a multiple of the grid size used.

We then investigated whether the ratio of signal to noise in the presence of a pattern at a characteristic scale might produce plots similar to those we observed in the real data. The results are shown in Figure 1–figure supplement 2. As expected, we observe that the correlation between age and the malaria positive fraction (i.e., rs) becomes more difficult to discern as the Signal:Noise ratio falls. However, spikes in rs on varying the cell size of the grid can still be noted up until the point at which the Signal:Noise ratio completely obscures any correlation.

Furthermore, we investigated the effect of adding a gradient at the edge of the block of characteristic scale over which the clustering emerges (as a proportion of the block size). The results are shown in Figure 1–figure supplement 3. The presence of a gradient lowers the overall correlation seen, which has the effect of gradually attenuating the “spikes” but not of removing them entirely.

However, we recognise that simulations of noise and gradients are necessarily artificial and real-world data are likely to contain many other more complex sources of variation. We may expect less distinct peaks in such circumstances. Nevertheless these simulations all show marked discontinuities in the correlation between age of malaria and MPF over varying cell size of a superimposed grid. We suggest that a gradual emerging of the correlation between MPF and average age of malaria is not readily explained by clustering at a single spatial scale even in the presence of random variation or gradients.

Taking these findings together with the findings of a) hotspots at multiple spatial scales and b) the findings from the semi-variogram and log-log plot used to calculate a fractal dimension (see response below), we conclude that our findings are indeed the result of spatial clustering of transmission varying consistently at every scale examined.

We have therefore edited the text throughout to read “spikes” instead of “shoulders” and added text to the Results, Discussion, and Methods.

Why would randomly reassigning spatial co-ordinates to homesteads lead to a correlation that declines very rapidly with grid sizes, rather than remaining constant?

Our reading of Figure 1d) is that this is indeed what happens with the random simulation (shown in red). There is some apparent variation in rs below a cell size of 0.5 km, but this is at a point where the confidence intervals of our estimate are quite wide (as a result of having few data points in each cell). It is the correlation in the observed data (shown in blue) that declines rapidly with reducing cell size.

The choice of analysis to demonstrate hotspots at every scale is unusual in the sense of involving a correlation between two quantities and how this changes with resolution. It is important that the expectations of different kinds of patterns be clear and that the significance of the observed pattern be established.

The results from simulations described above demonstrate the expectations with different kinds of pattern, and support the conclusion that that an inverse correlation between age and malaria positive fraction was present at every geographical scale examined within our study. This led us to predict that hotspots will occur at every scale, a conclusion that we empirically tested and confirmed as shown in Figure 2. This latter analysis does not depend on a correlation between two quantities but rather only considers the spatial clustering of febrile malaria cases. Similarly the analysis of the semivariogram, now added as Figure 2–figure supplement 1

(see point 2 below), suggests spatial clustering of febrile malaria cases at every spatial scale examined. Hence our conclusion of spatial clustering at every scale does not depend only on the inverse correlation between age and MPF, but on two further analyses (i.e., the demonstration of hotspots within hotspots in Figure 2, and the semivariogram shown in Figure 2–figure supplement 1), which do not involve a correlation between two variables.

2) The title and Introduction emphasize the 'fractal' nature of the patterns. This term implies self-similarity in a stronger sense than the lack of an identifiable characteristic scale over some range of resolutions. In this sense, the calculation of a fractal dimension is important but has been relegated here to the Discussion with no actual evidence presented to demonstrate the dimension and the range of scales over which this number was obtained. This plot and the strength of the evidence for a fractal pattern need to be included.

We have added the semivariogram plot and log-log semivariogram plot used to calculate the fractal dimension is shown in Figure 2–figure supplement 1 and Figure 2–figure supplement 2. The fractal dimension was calculated from the gradient of this line over the full range of the spatial scale displayed. We have added accompanying text to the manuscript in the Results and Methods.

3) The authors need to say where they obtained the SaTScan software, what version (or when downloaded). They also need a more detailed overview of how this method finds hot spots.

SaTScan software is free to download from the SaTScan website. Further details have been added to the relevant paragraph in the Methods section.

4) The possibility of conflation with mean age of infection should be considered. Mean age of infection results from the interaction between infection risk, reporting risk, and the mean age of children present. Age of children present could be an important confounder of the studied relationship: areas with younger children might have higher malaria rates and lower mean age of infection even if there is no signal from the direct causal relationship between them. Ideally, the authors would construct a statistic that measures age in the malarious population relative to the control population at each spatial scale. This issue must at least be carefully discussed.

The average age of children with non-malarial fever did not show any spatial clustering (Moran’s I=0.01, p=0.5 within 1 km and Moran’s I=0.02, p=0.5 within 5 km) and was not associated with MPF (rs=-0.02, p=0.4). We normalized age of febrile malaria for age of children with non-malarial fever by calculating the absolute difference between the two and found that there was still a negative correlation with MPF (rs=-0.08, p=0.011). However, the correlation is not as strong as that seen without normalization (rs) =-0.16, p<0.0001), partly because the normalization requires data in each homestead to calculate average age for children with and without malaria and therefore observations are dropped (i.e. ∼1,200 of ∼1,500 homesteads), and partly because normalizing according to the average age of non-malarial fever likely adds noise to the statistic. We therefore include the former results in the manuscript as reassurance that the studied relationship between MPF and average age of febrile malaria is unlikely to be confounded by spatial variation in average age of children in the community, but prefer to retain the use of average age of febrile malaria as our metric for the more detailed analysis according to spatial scale presented in the main body of the paper. We have added text to the Results section accordingly.

5) The discussion of ITN use is not entirely convincing. Is there a way to include ITN use in the analysis, and thus really control for it?

ITN use can indeed be included as a covariate in the SaTScan analysis. Using the data from 2009 and 2010, we re-ran the hotspot analysis using SaTScan with and without the inclusion of ITN use as a covariate. The addition of ITN use as a covariate changed the location of the hotspot by 120m, and changed the predicted radius of the hotspot from 5.4 to 5.2km. On re-analysis of the homesteads within the 5.4km hotspot, a further 0.87km hotspot was identified the position and radius of which were not altered by the inclusion of ITN use as a covariate. Finally, within this 0.87km hotspot the same 7 homesteads were identified as a hotspot irrespective of the inclusion of ITN use as a covariate.

How much may the use of impregnated bednets be spatially or temporally heterogeneous?

ITN use showed a random spatial distribution (Moran’s I=0.02, p=0.5). The Spearman’s rank correlation for proportion of homesteads using an ITN between these two years was 0.35 (p<0.0001) indicating that homestead use varied between years. Hence ITN use did not show an auto-correlation indicating non-random spatial heterogeneity, but was temporally heterogeneous. ITNs provided personal protection (i.e. OR=0.69, 95%CI 0.67 to 0.8, p<0.001), but adjusting for them did not appear to alter the location of hotspots (as described above).

The correlation with variance in MPF seems like a red herring, and should be dropped.

Indeed it was not our intention to imply that the correlation with variance in MPF was significant, and we have revised our wording to clarify this, and provide some context to our findings on personal protection of ITNs with regard to previous literature. Taking the three points above together, we have revised the paragraph on ITN use in the Results section.

6) It would be helpful to readers to know if, within the approximately 16 km diameter of the reported area, there are any obvious geographical differences – e.g., of altitude or ground water/foliage, that might affect malaria transmission. In the Discussion the authors state that 'Accurate characterization of these patterns will inform optimized surveillance and control policies'. Is it possible that such characterization within the study area would have produced results comparable with (and more easily obtained) than the data utilized from clinic records?

There is only modest variation in altitude within the study area, from 30 to 180 metres above sea level (IQR 49-99 metres). Nevertheless, regarding the general point raised there is a previous literature that we now reference. Malaria transmission has been shown to vary by geographical features such as altitude (Reyburn et al), cultivation practices (Lindsay et al), streams and dams (Ghebreyesus et al). Ecological models have been developed using frequentist techniques (Omumbo et al), Bayesian approaches (Craig et al) and fuzzy logic (Snow et al). However, the same ecological factor may act inconsistently in different geographical areas (Gemperli et al, Noor et al, Kleinschmidt et al). Hence although there is obvious scope for analysis of our data in relation to geographical features, we believe this would be an appropriate topic for a further publication rather than an additional analysis to include here. On the other hand, there are some general points that we add to the discussion in response.

We identify a fractal pattern with clustering at multiple geographical scales. It therefore follows that in order to develop a model based on geographical features one would either need to identify a key feature that can be mapped on multiple geographical scales, or a combination of geographical features that can exist across multiple scales. For instance, we note a river in the Eastern part of the study area running North to South, and the main hotspot is associated with this river (Figure 2a). However, the hotspot is associated with the Southern part but not the Northern part of the river, for reasons that are unclear to us. Furthermore, within the main hotspot there is further variation in risk (Figure 2b) that does not appear to depend on distance from the river, but is located at a particular location along the river, with lower risk at other points along the river. Hence we reason that it is unlikely that any single geographical feature predicts a substantial portion of the variation in malaria risk, and any attempt to develop a geographical model needs to take into account the fractal pattern of spatial clustering. We summarize these considerations in a revised version of our final paragraph in the Discussion.

7) The investigators have data on the parasitaemia prevalence among children presenting without fever (and without trauma). The suggestion here is not that the authors repeat their analysis using parasitaemia prevalence in the non-febrile but that they discuss whether similar results might have been obtained. Alternatively, were parasite densities measured on blood films, so that the likelihood that a fever is actually due to malaria could be estimated?

An analysis of asymptomatic parasitaemia would likely yield different results from an analysis of febrile parasitaemia, as we demonstrated in a previous study (Bejon et al., 2010 “Stable and unstable malaria hotspots in longitudinal cohort studies in Kenya”). However we do not have microscopy samples from an appropriate group in Pingilikani to limit analysis to asymptomatic parasitaemia. Samples from children attending with trauma have been taken as indicators of asymptomatic parasitaemia in the community, but as the reviewer notes samples from children with trauma were not examined. Children presenting without objective fever at the point of sampling in the dispensary are nevertheless presenting with an acute illness, and hence may not be very different from the objectively febrile group. We have shown in previous analysis of parasite densities from children presenting to the dispensary compared with parasite densities from cross-sectional surveys in the community that the malaria attributable fractions are substantial even among children without measured fever attending the dispensary (i.e. 40% (95%CI 39-41%), compared with 75% (95%CI 74-76%) Olotu et al 2011, “Defining Clinical Malaria: The Specificity and Incidence of Endpoints from Active and Passive Surveillance of Children in Rural Kenya”).

We do have parasite densities for the blood films here, but since we do not have appropriately matched community based sampling we cannot repeat a spatially explicit version of the malaria attributable fraction modelling in order to determine the likelihood that fever is actually due to malaria. Therefore we add to the Discussion accordingly.

8) The practical implications could be written more clearly. The principal suggestion is that “the use of one month cycles of surveillance to target 4 to 8 km diameter hotspots would be optimal” – some expansion on what is meant by “cycles” and “surveillance” - is it recommended that parasitaemia prevalences among febrile children should be the instrument of surveillance? And how frequently should the one-month surveys be carried out?

We have re-written this section for clarity. We did indeed intend that monitoring parasitaemia among febrile children would be the instrument of surveillance. We intended that this would be done in cycles, with one month of monitoring, followed by one month of targeted intervention (during which parallel monitoring could take place in order to plan the following months targeting). These could in theory be undertaken throughout the year, or to conserve resources could be undertaken during the rainy season. We have revised the Discussion accordingly.

9) The authors need to be more careful throughout about “negative inference”. The null hypothesis is rarely true, and never confirmed. For example: the wording at the end of the second paragraph of the Introduction is far too strong – the authors should argue that this bias is likely to be small, not that it is unlikely to exist; similarly the “cannot be used” result is presumably a lack of statistical significance and does not imply there is no difference in subjective fever experience (it may imply that this difference is small, highly variable, or both); in the first paragraph of the Results, the authors should say MPF “was not found” to vary ; in the Results section “ITN use and spatial variation in risk”, the distribution was not “random” – the authors found no significant pattern, similarly they did not find a significant correlation with variance. The authors should be aware of this point when addressing the issue of non-malarial fevers, as well.

We accept this point and the necessary edits have been made throughout.


Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

RESOURCES