Abstract
Background
Robust household sampling, commonly applied for population-based investigations, requires sampling frames or household lists to minimize selection bias. We have applied Google Earth Pro satellite imagery to constitute structure-based sampling frames at sites in Pikine, Senegal; Pietermaritzburg, South Africa; and Wad-Medani, Sudan. Here we present our experiences in using this approach and findings from assessing its applicability by determining positional accuracy.
Methods
Printouts of satellite imagery combined with Global Positioning System receivers were used to locate and to verify the locations of sample structures (simple random selection; weighted-stratified sampling). Positional accuracy was assessed by study site and administrative subareas by calculating normalized distances (meters) between coordinates taken from the sampling frame and on the ground using receivers. A higher accuracy in conjunction with smaller distances was assumed. Kruskal-Wallis and Dunn multiple pairwise comparisons were performed to evaluate positional accuracy by setting and by individual surveyor in Pietermaritzburg.
Results
The median normalized distances and interquartile ranges were 0.05 and 0.03–0.08 in Pikine, 0.09 and 0.05–0.19 in Pietermaritzburg, and 0.05 and 0.00–0.10 in Wad-Medani, respectively. Root mean square errors were 0.08 in Pikine, 0.42 in Pietermaritzburg, and 0.17 in Wad-Medani. Kruskal-Wallis and Dunn comparisons indicated significant differences by low- and high-density setting and interviewers who performed the presented approach with high accuracy compared to interviewers with poor accuracy.
Conclusions
The geospatial approach presented minimizes systematic errors and increases robustness and representativeness of a sample. However, the findings imply that this approach may not be applicable at all sites and settings; its success also depends on skills of surveyors working with aerial data. Methodological modifications are required, especially for resource-challenged sites that may be affected by constraints in data availability and area size.
Keywords: satellite imagery, geospatial sampling frame, positional accuracy, sub-Saharan Africa
Household sampling is a common and resource-efficient method for rapid, in-depth investigations in population-based studies. This includes assessment of health indicators, morbidity and mortality rates, vaccination coverage, healthcare behavior, and relevant sociodemographic, socioeconomic, and ecological information among the population of interest [1–3]. Data derived from a representative sample allow inference findings to an entire population under investigation [1–4]. A comprehensive sampling frame or household list largely reduces the possibility of selection bias. Ideally, a sampling frame is available through a demographic surveillance system (DSS), which longitudinally records demographic and vital statistics of individuals [5, 6]. However, DSSs are limited to distinct sites and their implementation and maintenance require long-term planning and enduring financial support [6].
Sampling procedures in resource-limited settings inherently are affected by distinct drawbacks, such as scarcity of demographic and geographic data and lack of a comprehensive sampling frame. There are, however, several options to cope with the absence of a household list by using convenience samples [7], choosing a random starting point, and selecting sampling units by applying systematic sampling [8–10]. Moreover, chain or respondent-driven sampling may be applied [11, 12]. In such a design, study subjects recruit additional subjects until a desired sample size is achieved. Further approaches include segment sampling, where subunits in a study area are divided into segments and all sampling units of a selected segment are enrolled [13], mosaic formation or rasterizing with continuous geographic data of a study area transformed into a raster sampling frame from which sampling units are drawn [14], or overlaying study areas with grid cells to establish a sampling frame [15–17]. Geographical information systems (GISs) are also used if household lists are not available [2, 4, 18]. Kondo et al [2] have described spatial sampling with modifications for postdisaster scenarios and ecological, environmental, and social studies. Satellite imagery has been applied to generate sampling frames for research on mosquito-borne diseases [19] and the impact resulting from natural and humanitarian crises in addition to healthcare behavior [20].
We have applied satellite imagery to establish sampling frames to assess healthcare behavior at three resource-limited sites (Table 1) of the Typhoid Fever Surveillance in Africa Program (TSAP) [21]. The TSAP network investigated the incidence of Salmonella infections at thirteen sentinel sites in ten countries (Burkina Faso, Ethiopia, Ghana, Guinea-Bissau, Kenya, Madagascar, Senegal, South Africa, Sudan, Tanzania) during the period 2010–2014 [22]. Satellite maps combined with Global Positioning System (GPS) receivers were used to locate and to verify the locations of selected structures on the ground. We present our experiences of this approach and findings from assessing its applicability by determining positional accuracy. Findings on the assessment of healthcare behavior within the TSAP study were presented elsewhere [23].
Table 1.
Country | Senegal | South Africa | Sudan | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Site | Pikine | Pietermaritzburg | Wad-Medani | |||||||||||||
Study period | September 2012–January 2013 | September–December 2013 | August–September 2013 | |||||||||||||
Google Earth Pro | ||||||||||||||||
Date of satellite imagery update(s) | March, June, October, November, December 2012 | March, May, July 2013 | March, September, October 2013 | |||||||||||||
Date satellite imagery used | June 2012 | July 2013 | March 2013 | |||||||||||||
Elapsed time between imagery used and study conduct | 2–3 mo | 1–2 mo | 4–5 mo | |||||||||||||
Setting | Urban to semiurban | Urban to semiurban | Urban to semiurban | |||||||||||||
Area, km2 | 7.98 | 343.56 | 6.47 | |||||||||||||
Administrative subunits | 6 | 22 | 10 | |||||||||||||
Population | 342 178 (2012: [22–24, 30]) | 361 582 (2011: [27–29]) | 48 000 (2012: [25, 26, 33]) | |||||||||||||
Population density per km2 | 42 886 | 1052 | 7409 | |||||||||||||
Population density per km2, by administrative subunit | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density |
1 | 75 999 | 4 | 49 361 | 1 | 1386 | 9 | 445 | 17 | 4292 | 1 | 22 848 | 5 | 2994 | 9 | 2727 | |
… | … | … | … | 2 | 1070 | 10 | 3634 | 18 | 6875 | … | … | … | … | … | … | |
… | … | … | … | 3 | 423 | 11 | 662 | 19 | 1960 | 2 | 15 810 | 6 | 12 370 | 10 | 26 293 | |
2 | 60 800 | 5 | 28 217 | 4 | 409 | 12 | 2298 | 20 | 6147 | … | … | … | … | … | … | |
… | … | … | … | 5 | 448 | 13 | 3472 | 21 | 4823 | 3 | 81 370 | 7 | 11 425 | … | … | |
… | … | … | … | 6 | 670 | 14 | 1203 | 22 | 2246 | … | … | … | … | … | … | |
3 | 18 000 | 6 | 44 081 | 7 | 536 | 15 | 4607 | … | … | 4 | 10 218 | 8 | 1911 | … | … | |
… | … | … | … | 8 | 677 | 16 | 6049 | … | … | … | … | … | … | … | … | |
No. of enumerated structures | 45 510 | 100 439 | 32 905 | |||||||||||||
Structure density per km2 | 5829 | 292 | 5086 | |||||||||||||
Structure density per km2, by administrative subunit | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density | AdSub | Density |
1 | 2087 | 4 | 5675 | 1 | 385 | 9 | 124 | 17 | 1192 | 1 | 2195 | 5 | 3183 | 9 | 9096 | |
… | … | … | … | 2 | 297 | 10 | 1009 | 18 | 1910 | … | … | … | … | … | … | |
… | … | … | … | 3 | 117 | 11 | 184 | 19 | 544 | 2 | 4409 | 6 | 998 | 10 | 8158 | |
2 | 3207 | 5 | 5197 | 4 | 114 | 12 | 638 | 20 | 1708 | … | … | … | … | … | … | |
… | … | … | … | 5 | 125 | 13 | 964 | 21 | 1340 | 3 | 3833 | 7 | 4707 | … | … | |
… | … | … | … | 6 | 186 | 14 | 334 | 22 | 624 | … | … | … | … | … | … | |
3 | 2406 | 6 | 11 591 | 7 | 149 | 15 | 1280 | … | … | 4 | 2096 | 8 | 6072 | … | … | |
… | … | … | … | 8 | 188 | 16 | 1680 | … | … | … | … | … | … | … | … |
Abbreviation: AdSub, administrative subunit.
MATERIALS AND METHODS
The study was implemented at sites in Pikine, Senegal (September 2012–January 2013), Pietermaritzburg, South Africa (September–December 2013), and Wad-Medani, Sudan (August–September 2013). It was approved by the ethics committees of the collaborating institutions and the ethical review board of the International Vaccine Institute.
Study Sites
Details of the selection of study sites have been described previously [21, 23]. In brief, the sites were chosen based on reports on human Salmonella infections, an infrastructure suitable for the surveillance of acute febrile conditions, and access to healthcare [22]. Pikine is a semiurban region in the east of Senegal’s capital, Dakar. Pietermaritzburg is the capital of the KwaZulu-Natal Province and located at the southeastern coast of South Africa. Wad-Medani is southeast of Sudan’s capital Khartoum and the capital of the Al Jazirah State.
Sites were highly diverse with respect to population and study area sizes, including administrative subunits, or AdSubs (Table 1). They varied in topography and vegetation, the road network, and formal and informal settlements (Supplementary Figure 1A–C). Across all sites, single-story and single-family households were common compared to multistory and multifamily households. A household was defined as a person or a group of related or unrelated persons living in the same dwelling unit, acknowledging one adult individual as household head, sharing the same housekeeping arrangements, and independently procuring food and other essentials for living [23].
Population sizes and boundaries of each site and its AdSubs were determined by combining different sources. Among them were up-to-date demographic information, population summary figures and growth rates [24–32], records of healthcare facilities [23], and administrative and geographic data, including sketch-maps [31, 33–37]. Boundaries of a study area and its AdSubs were transferred from sketch maps, or the sketch maps were digitized and superimposed onto Google Earth Pro imagery (version 6.2.2.6613; Google, Mountain View, California).
Sampling Frame
We used the latest satellite imagery available from Google Earth Pro (Pikine: June 2012; Pietermaritzburg: July 2013; Wad-Medani: March 2013; Table 1). Every single-standing structure defined as an edifice not connected to another structure in the respective satellite image, of appropriate size and rectangular or square shape was enumerated by positioning a placemark at its approximate center. Irregularities in size, shape, and spacing between structures made manual enumeration preferable (Supplementary Figure 1A–C). Buildings of nonresidential character based on best local knowledge were not enumerated. Figure 1 shows the sampling frame in Pikine. Sampling frame data were exported from Google Earth Pro and imported into ArcGIS version 10.2 (Esri, Redlands, California) to assign a unique identifier and coordinates to each placemark.
Random Sample Selection
The minimum number of household interviews required (N0) was 492 at each site. Sample size calculations were based on the Cochran formula for categorical data; their assumptions (95% confidence interval, 5% precision, 2.0 design effect) have been described previously [23]. N0 was distributed by applying weighted-stratified sampling and selected from a sampling frame by serial simple random selection using Microsoft Visual FoxPro (version 9.0; Microsoft, Redmond, Washington; Figure 2). Approximately 10%–20% replacement structures by AdSub (Figure 2) were chosen using the same selection procedure as for N0 if a sample structure was a nonresidential building, eligible respondents were not available after three consecutive visits, or participation was refused.
Structure Identification
The study teams were comprised of community nurses and healthcare workers experienced in conducting household surveys (Pikine: eight interviewers; Wad-Medani: eight interviewers; Pietermaritzburg: twenty interviewers). They were trained on all study procedures and, in particular, on the localization and the verification of locations of selected structures by using satellite maps and GPS receivers as not all interviewers were familiar with observing study areas from an aerial perspective. The assignment of AdSubs and sample structures to the surveyors was done arbitrarily across all sites.
Poster-sized (60 × 60 cm to 60 × 90 cm) printouts of Google Earth Pro satellite imagery with high resolution (approximately 500–600 m “eye altitude” in high-density and approximately 1700–1800 m “eye altitude” in low-density settings; Table 1 and Supplementary Figure 1A–C) were prepared to depict selected structures flagged and labeled with the respective identifiers (Figure 2). “Eye altitude” is a term used in Google Earth for viewing heights. GPS receivers (Garmin-eTrex, GPS accuracy <10 m; differential GPS [Wide Area Augmentation System], accuracy 3 m; 12-channel receiver; Garmin Ltd, Lenexa, Kansas) set in World Geodetic System 1984 were used to verify locations of sample points. Once a structure was identified on the ground with reference to a landmark, interviewers took the GPS readings allowing sufficient time to obtain satellite signals with an accuracy of ≤ three meters. The GPS receiver was positioned to the structure as close as possible, in a static position, and an open area, strictly avoiding tree cover, roof cover, balconies, or verandas to ensure barrier-free reading, reduction of interferences, and increased accuracy. A structure was replaced if it could not be identified correctly.
Positional Accuracy
We have evaluated the applicability of the described approach by determining the positional accuracy of sample points. This was performed by analyzing distances (meters) between geographic coordinates taken from the sampling frame and on the ground by receivers, assuming zero distances and approximation of earths radius based on the Pythagorean theorem [7]. We surmised that the smaller the distances, the more accurately structures were identified and, thereby, the more representative were our samples by AdSubs and sites. The computed distances were normalized for improved comparability (Supplementary Table 1). Microsoft Office Excel version 2010, expanded with Excel add-on tools, was applied for all calculations. The equations are explained in Supplementary Table 1.
Distances by each site and AdSub were assessed by generating medians, interquartile ranges (IQRs), and quartiles displayed by box plots, and root mean square errors (RMSEs) [38, 39]. Obtained distances were categorized into quintiles and graded correspondingly as very good (lowest quintile), good, fair, poor, and very poor (highest quintile). Furthermore, distances by AdSub of Pietermaritzburg only were classified into tertiles based on population and structure density (Supplementary Figure 2) and graded as low, medium, and high; medians, IQR, and quartiles by grade or setting are given in box plots. The positional accuracy of the described approach was also assessed for each interviewer of Pietermaritzburg by computing medians, IQRs, and quartiles displayed by box plots, RMSEs, and quintiles of distances, followed by grading. In addition, the nonparametric rank-based Kruskal-Wallis test was performed to evaluate distance differences by setting and interviewer of Pietermaritzburg. The test was followed by Dunn multiple pairwise comparisons, assuming rejection of the null hypothesis (H0) of no difference in the distribution of distances by setting or interviewer. Bonferroni correction was applied to compensate for incorrectly rejecting the H0 [40, 41].
RESULTS
Sampled Structures
The number of enumerated structures constituting the sampling frame was 45 510 in Pikine (Figure 1), 100 439 in Pietermaritzburg, and 32 905 in Wad-Medani. The enumeration and preparation of the satellite maps, including random selection and visualization of selected sample points using Google Earth Pro, took approximately two weeks per site.
In Pikine, 597 structures were included into the study, of which 495 (83%) were initially selected and 101 (17%) were substituted. In Pietermaritzburg, 2402 sample points were identified of which 461 (19%) were replaced; of those replaced, 164 (36%) were nonresidential, 143 (31%) were excluded due to refusal in participation, and in 154 (33%) household members were not addressable after 3 consecutive visits. A total of 549 structures were enrolled in Wad-Medani, of which 412 (75%) were initially selected and 137 (25%) were replaced.
Accuracy of Sample Identification
The median normalized distances (formula, refer to Supplementary Table 1) and RMSEs were 0.05 (IQR, 0.03–0.08) and 0.08, respectively, in Pikine; 0.09 (IQR, 0.05–0.19) and 0.42 in Pietermaritzburg; and 0.05 (IQR, 0.00–0.10) and 0.17 in Wad-Medani. Quartiles of distances by AdSub of each site, including RMSEs, are displayed in Figure 3. The quintile categorization by site and AdSub revealed largest proportions in the categories “good” and “fair” in Pikine, “poor” and “very poor” in Pietermaritzburg, and “very good” and “fair” in Wad-Medani (Figure 4); 4 of Pietermaritzburg’s AdSubs [5, 7, 16, 17] with the largest proportions in the categories “very good” and “good” showed also low median values, IQRs, and RMSEs.
The median distances and RMSEs were 0.09 (IQR, 0.05–0.18) and 0.40 in the low-density, 0.11 (IQR, 0.05–0.23) and 0.37 in the medium-density, and 0.09 (IQR 0.00–0.15) and 0.47 in the high-density setting of Pietermaritzburg; quartiles of distances by AdSub and setting are illustrated in Supplementary Figure 2. The quintile categorization revealed largest proportions in the categories “poor” and “very poor” in the low- and medium-density settings, and “very good” and “poor” in the high-density setting. An Hc (2) = 5.991 with P < .0001 (Hc_observational = 28.202 ≥ Hc_critical = 5.991; rejection of H0) was found by using Kruskal-Wallis test. The two-tailed P values of Dunn multiple pairwise comparisons (Bonferroni significance level = .0167) indicated significant differences between the low- and medium-density settings (P = .0007) as well as the medium- and high-density (P = .0001) settings.
Quartiles of distances by interviewers of Pietermaritzburg are displayed in Supplementary Figure 3, including the RMSEs. The quintile categorization is depicted in Supplementary Figure 4. Largest proportions were found in the categories “very good” and “good” for one-third of the interviewers (interviewers 3, 6, 8, 10, 15, and 16) in addition to small median values, narrow IQRs, and low RMSEs. In contrast, the largest proportions were seen in the categories “poor” and “very poor” for one-fifth of interviewers (interviewers 5, 14, 18, and 19) besides large median values, wide IQRs, and high RMSEs. The Kruskal-Wallis test revealed an Hc (19) = 30.144 with P < .0001 (Hc_observational = 239.317 ≥ Hc_critical = 30.144; rejection of H0); P values of Dunn comparisons by interviewer are given in Supplementary Table 2.
DISCUSSION
The application of satellite imagery is an efficient tool as also shown by other research conducted, even for sampling frames of varying sizes and in diverse sites as seen in our multicountry study. While we have chosen this particular approach for conducting household surveys in our TSAP study sites, we did not conduct a head-to-head comparison to other methods and, hence, can only report here on the experiences we made. With regard to its applicability we observed here, the evaluation of the described geospatial approach revealed a lower accuracy for Pietermaritzburg and a higher accuracy for Pikine and Wad-Medani in the identification of structures. This finding is based on largest median values, IQRs, RMSEs, and outliers of distances (Figure 3), as well as greatest proportions in the high and highest quintile categories (Figure 4) observed for Pietermaritzburg.
Though the analyses revealed only minor discrepancies in the statistical parameters (median, IQR, RMSE, outliers, quintile categorization) by setting of Pietermaritzburg, Kruskal-Wallis and Dunn tests indicated significant differences in distances between low- and medium-density and medium- and high-density settings. This suggests that the approach presented could be followed more accurately in the low- and high-density settings, supporting findings of previous studies [2]. Moreover, this result may explain why the geospatial method was more successful in Pikine and Wad-Medani, both settings of high population density (Table 1). However, further investigations are required to assess if this is a real finding or just a result caused by an almost one-third smaller sample size in the medium-density setting of Pietermaritzburg. Future investigations should also include thorough research on drivers of accuracy. However, since we performed a retrospective assessment of our geospatial approach, this is not available, which is a major limitation.
Interviewers in Pietermaritzburg performed the presented approach with varying exactness, corroborated by the interrelation observed between the quintile categorization, medians, IQRs, and RMSEs by interviewers. For interviewers yielding the largest proportions in the lowest and low quintiles, small median values, IQRs, and RMSEs were observed. In contrast, if largest proportions were found in the high and highest quintiles, large median values and wide IQRs and high RMSEs were seen. Kruskal-Wallis and Dunn tests revealed statistically significant differences for those interviewers showing high accuracy compared to interviewers with poor accuracy in carrying out this approach and vice versa. However, we cannot provide a plausible explanation for this finding. A much larger team of interviewers might have resulted in a general poorer success.
A limitation is that we could not directly compare our rigorous method with alternatives as we have no quantifiable data on whether the method utilized in the TSAP program justifies the extra work and cost compared to other methods. A pitfall of satellite imagery is the lack of structure differentiation into residential or nonresidential from the aerial perspective. In fact, this applied to all sites [2, 4, 20]. The need of replacing structures may increase the sample size and prolong the study conduct. Incorrect identification of terrestrial sample points is a further weakness that applied to all sites, in particular in areas where buildings were not lined up as described in an earlier study [2], clustered, or interlaced. The latter problems were likely caused by interfering factors like reflective materials (water, metallic objects), obstructive buildings, and environmental diversity influencing the accuracy of GPS readings, and resulting in positional errors (multipath effect) [7, 38, 39, 42]. A further limitation is that there may have been some degree of inaccuracy in obtaining GPS readings of selected sample points in the field due to the intrinsic inaccuracy, which we have not adjusted for in the analysis. Structures were enumerated by positioning a placemark at the approximate center, whereas GPS readings on the ground were performed simply as close as possible to structures.
Nevertheless, the use of satellite maps was beneficial across all sites. They guided interviewers on the ground, facilitated identification of structures, and allowed the recording of the daily study progress. This is in accordance with previous observations [20, 43]. Geospatial approaches relying on computerized random selection of sample points was applied across all three sites. This assures that each point has the same probability of being chosen and thus increases the degree of randomness of samples selected. It also minimizes the possibility of introducing selection bias by study staff by allowing a prioritized sample selection in densely inhabited areas, or near a study area’s center or random starting point for instance, and a nonprioritized selection of structures in remote areas [2, 7]. We believe that this makes spatial sampling frames superior to other techniques that use no or inappropriate sampling frames such as convenience sampling [7], systematic sampling [8, 9] or chain or respondent-driven sampling [11, 12], and which rely on a homogeneous study population as applied in segment sampling [13]. An advantage of manual enumeration as performed across our sites overautomated techniques such as mosaic formation, rasterizing [14], or grid cells [15–17] is that it is particularly suitable if buildings are irregular in size, shape, and spacing.
CONCLUSIONS
The application of satellite imagery offers a broad spectrum for research and has been deployed in the TSAP program. Actual healthcare utilization data are part of the manuscripts that were published previously [22, 23]. The evaluation of this approach conducted in a comparable and standardized manner indicates that sources of selection bias are reduced and robustness is increased, if performed with high accuracy. However, our findings imply that the applicability of this geospatial approach may not be suitable for all sites and settings, in particular not for medium-density settings due to an overall poorer study success. They also indicate that the skills of staff working with aerial data considerably affect the outcome of this approach, as seen for Pietermaritzburg. A potential modification of the method may be to apply ArcGIS tools to randomly spread geographic points across study areas based on a required sample size and the sampling strategy instead of enumerating individual structures. This needs to be examined in other resource-limited sites that are even more affected by data availability and constraints in area size than those selected for the present study.
Supplementary Data
Supplementary materials are available at Clinical Infectious Diseases online. Consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author.
Notes
Acknowledgments. The authors thank Soo-Young Kwon and Hyon Jin Jeon of the International Vaccine Institute (IVI) for supporting the Typhoid Fever Surveillance in Africa Program (TSAP) in administrative matters. We are grateful for the exceptional efforts of all contributors at the Institut Pasteur de Dakar and associated institutes in Senegal, which made this research possible: household visits/interviewers (Institut Pasteur de Dakar: Adama Dieng, Aliou Faye, Andulaye Diop, Coumba Thiaw, Khadiatou Diop, Mamadou Faye, Majeur Cissé, Souane Moussa); data management (Institut Pasteur de Dakar: Atta Gadji Kandji, Joseph Faye); and scientific support (Diagne Fallou and Lo Ibrahima, Responsable du commune d’arrondisssment de Pikine Ouest; Djibril Diallo Institut Sénégalais de Recherches Agricoles (ISRA), Unité d’Information et de Valorisation (UNIVAL), and Responsable du commune d’arrondisssment de Djiddah Thiaroye Kao; Thiaroye Kao, Unité d’Information et de Valorisation, Responsable du commune d’arrondisssment de Djiddah; Marème Dia Ndiaye, République du Sénégal/Ministère de la Santé et de la Prévention; Souleymane Diop, Centre de Suivi Ecoloquige; Papa Djibril Ba, République du Sénégal/Ministère de l’Economique et des Finances, Agence Nationale de la Statistique et de la Démographie). The authors also thank all contributors at the University of Gezira and the Surgical Pediatric Hospital in Wad-Medani, Sudan, for the exceptional efforts, which allowed the conduct of this research: household visits/interviewers (Iman Gasmelseed, Mohammed Omer, Niama Mubarak, Nuseiba Abdalla, Omnia Adam, Rajaa Bahlol, Rasha Khidir, Suhair Haydar). Finally, the authors are grateful for the exceptional efforts of all contributors at the National Institute for Communicable Diseases (NICD) as well as the Red Cross in South Africa for their outstanding efforts, which made this research possible: household visits/interviewers (Red Cross: Annetta Ndlovu, Bukeka Nzimande, Dumisani Mzobg, Jabu Gumede, Happiness Mthembu, Hlengiwe Ngcobo, Montsikelele Nkwanyana, Ncamsile Yengwa, Nonkululeko Mettunu, Nontobeko Mknize, Sandisa Ndzimela, Sandra Majozi, Peggy Hlubi, Pinky Thabethe, Sbongile Phetha, Siyabonga Hlatshwayo, Sizwe Skosana, Xolani Zulu, Zandile Ndhlovu, Zanele Mncube); data management (NICD: Makatisane Papo, Jabu Mabuyakhulu); administrative support (NICD: Sibongile Walaza); support in study coordination (Red Cross: Derick Naidoo, Zinny Zulu); and scientific support (NICD: Adam Cohen, Cheryl Cohen, Claire von Mollendorf). The study was approved by the respective ethics committees, the Comité National d’Ethique pour la Recherche en Santé in Senegal, the KwaZulu-Natal Provincial Health Research Committee (PHRC) and the Human Research Ethics Committee (Medical) in South Africa, the National Health Research Ethics Committee in Sudan, and the Ethical Review Board of the IVI. Written informed consent was obtained from all study participants.
Disclaimer. The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Bill & Melinda Gates Foundation, the University of Cambridge, or IVI.
Financial support. This research was funded by the Bill & Melinda Gates Foundation [OPP1127988]. The International Vaccine Institute acknowledges its donors including the Republic of Korea and the Swedish International Development Cooperation Agency. This publication was made possible through a grant from the Bill & Melinda Gates Foundation [OPP1201031].
Supplement sponsorship. This article was published as part of the supplement “Severe Typhoid Fever in Africa (SETA) Program” sponsored by the International Vaccine Institute.
Potential conflicts of interest. The authors report no potential conflicts of interest. All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.
References
- 1. Ansah EK, Powell-Jackson T. Can we trust measures of healthcare utilization from household surveys? BMC Public Health 2013; 13:853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Kondo MC, Bream KD, Barg FK, Branas CC. A random spatial sampling method in a rural developing nation. BMC Public Health 2014; 14:338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. United Nations. Designing household survey samples: practical guidelines 2008. Available at: https://unstats.un.org/unsd/demographic/sources/surveys/Series_F98en.pdf. Accessed 25 December 2018.
- 4. Escamilla V, Emch M, Dandalo L, Miller WC, Martinson F, Hoffman I. Sampling at community level by using satellite imagery and geographical analysis. Bull World Health Organ 2014; 92:690–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Adazu K, Lindblade KA, Rosen DH, et al. . Health and demographic surveillance in rural western Kenya: a platform for evaluating interventions to reduce morbidity and mortality from infectious diseases. Am J Trop Med Hyg 2005; 73:1151–8. [PubMed] [Google Scholar]
- 6. Ye Y, Wamukoya M, Ezeh A, Emina JB, Sankoh O. Health and demographic surveillance systems: a step towards full civil registration and vital statistics system in sub-Sahara Africa? BMC Public Health 2012; 12:741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Siri JG, Lindblade KA, Rosen DH, et al. . A census-weighted, spatially-stratified household sampling strategy for urban malaria epidemiology. Malar J 2008; 7:39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Bostoen K, Chalabi Z. Optimization of household survey sampling without sample frames. Int J Epidemiol 2006; 35:751–5. [DOI] [PubMed] [Google Scholar]
- 9. Bennett S, Woods T, Liyanage WM, Smith DL. A simplified general method for cluster-sample surveys of health in developing countries. World Health Stat Q 1991; 44:98–106. [PubMed] [Google Scholar]
- 10. World Health Organization. Vaccination coverage cluster surveys: reference manual.2015. Available at: https://www.who.int/immunization/monitoring_surveillance/Vaccination_coverage_cluster_survey_with_annexes.pdf. Accessed 3 June 2019.
- 11. Emery S, Lee J, Curry SJ, et al. . Finding needles in a haystack: a methodology for identifying and sampling community-based youth smoking cessation programs. Eval Rev 2010; 34:35–51. [DOI] [PubMed] [Google Scholar]
- 12. Wangroongsarb P, Satimai W, Khamsiriwatchara A, et al. . Respondent-driven sampling on the Thailand-Cambodia border. II. Knowledge, perception, practice and treatment-seeking behaviour of migrants in malaria endemic zones. Malar J 2011; 10:117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Milligan P, Njie A, Bennett S. Comparison of two cluster sampling methods for health surveys in developing countries. Int J Epidemiol 2004; 33:469–76. [DOI] [PubMed] [Google Scholar]
- 14. Lister AJ, Scott CT. Use of space-filling curves to select sample locations in natural resource monitoring studies. Environ Monit Assess 2009; 149:71–80. [DOI] [PubMed] [Google Scholar]
- 15. Keating J, Macintyre K, Mbogo CM, Githure JI, Beier JC. Characterization of potential larval habitats for Anopheles mosquitoes in relation to urban land-use in Malindi, Kenya. Int J Health Geogr 2004; 3:9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Keating J, MacIntyre K, Mbogo C, et al. . A geographic sampling strategy for studying relationships between human activity and malaria vectors in urban Africa. Am J Trop Med Hyg 2003; 68:357–65. [PubMed] [Google Scholar]
- 17. Troyo A, Fuller DO, Calderón-Arguedas O, Beier JC. A geographical sampling method for surveys of mosquito larvae in an urban area using high-resolution satellite imagery. J Vector Ecol 2008; 33:1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Gammino VM, Nuhu A, Chenoweth P, et al. . Using geographic information systems to track polio vaccination team performance: pilot project report. J Infect Dis 2014; 210(Suppl 1):S98–101. [DOI] [PubMed] [Google Scholar]
- 19. Moss WJ, Hamapumbu H, Kobayashi T, et al. . Use of remote sensing to identify spatial risk factors for malaria in a region of declining transmission: a cross-sectional and longitudinal community survey. Malar J 2011; 10:163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Lowther SA, Curriero FC, Shields T, Ahmed S, Monze M, Moss WJ. Feasibility of satellite image-based sampling for a health survey among urban townships of Lusaka, Zambia. Trop Med Int Health 2009; 14:70–8. [DOI] [PubMed] [Google Scholar]
- 21. von Kalckreuth V, Konings F, Aaby P, et al. . The Typhoid Fever Surveillance in Africa Program (TSAP): clinical, diagnostic, and epidemiological methodologies. Clin Infect Dis 2016; 62(Suppl 1):S9–S16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Marks F, von Kalckreuth V, Aaby P, et al. . Incidence of invasive Salmonella disease in sub-Saharan Africa: a multicentre population-based surveillance study. Lancet Glob Health 2017; 5:e310–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Panzner U, Pak GD, Aaby P, et al. . Utilization of healthcare in the Typhoid Fever Surveillance in Africa Program. Clin Infect Dis 2016; 62(Suppl 1):S56–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. République du Sénégal. Ministère de la Santé, de l’Hygiène publique et de la Préention. Région Médicale de Dakar, District sanitaire de Pikine. Senegalese MoH, 2012. [Google Scholar]
- 25. République du Sénégal. Ministère de l’Economie et des Finances. Données démographiques du départment de Pikine. Senegalese Ministry of Economy and Finances, 2011. [Google Scholar]
- 26. République du Sénégal. Ministère de l’Economie et des Finances. Population de Pikine. Senegalese Ministry of Economy and Finances, 2002. [Google Scholar]
- 27. University of Gezira. Population studies center. Sudan: Department of Statistics, University of Gezira, 2008. [Google Scholar]
- 28. Central Bureau of Statistics. Southern Sudan Centre for Census Statistics and Evaluation (SSCCSE). 5th Sudan Population and Housing Census; Sudanese Central Bureau of Statistics, 2008. [Google Scholar]
- 29. Msunduzi Integrated Development Plan (IDP) for 2012–13 (South Africa). Pietermaritzburg Msunduzi, 2013. [Google Scholar]
- 30. Statistics South Africa. Census 2011 statistical release. Statistics South Africa (statssa), 2011. [Google Scholar]
- 31. République du Sénégal. Ministère de l’Urbanisme, de l’Habitat, de la Construction et de l’Hydraulique. Senegalese Ministry of Urbanization, Construction and Hydraulics. [Google Scholar]
- 32. Statistics South Africa. Census 2011 municipal report—KwaZulu-Natal. Statistics South Africa (statssa), 2011. [Google Scholar]
- 33. Centre de Suivi Ecologique: Banque de données urbaines. Interprétation de photographies aériennes, mission Haut-Mont-CSE complètement à partir de données existantes. Structures de santé dans la zone de Pikine-Guinaw Rail; 2004. [Google Scholar]
- 34. Bureau National du Recensement Région de Dakar. Département de Pikine.Les Equipements Urbains. Senegalese National Census Bureau of Dakar, 2004. [Google Scholar]
- 35. Ministry of Public Planning and Surveying Unit. Gezira State Sudan. Gezira State: Ministry of Public Health Planning and Surveying Unit of Sudan, 2012. [Google Scholar]
- 36. Department of Local Government and Traditional Affairs. KwaZulu-Natal Provincial Government. Development Information Services (GIS). The Development Information Services of the Department of Government and Traditional Affairs. 2009. [Google Scholar]
- 37. Municipal Demarcation Board. The Msunduzi (KZN225), publication of the delimitation of wards within Msunduzi Local Municipality. Pietermaritzburg Msunduzi Municipality, 2010. [Google Scholar]
- 38. Schipperijn J, Kerr J, Duncan S, Madsen T, Klinker CD, Troelsen J. Dynamic accuracy of GPS receivers for use in health research: a novel method to assess GPS accuracy in real-world settings. Front Public Health 2014; 2:21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Vazquez-Prokopec GM, Stoddard ST, Paz-Soldan V, et al. . Usefulness of commercially available GPS data-loggers for tracking human movement and exposure to dengue virus. Int J Health Geogr 2009; 8:68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Lehmann LE. Nonparametrics: statistical methods based on ranks. 1st ed. Springer; 1975. [Google Scholar]
- 41. Siegel SCN. Nonparametric statistics for the behavioral sciences. 2nd ed. McGraw-Hill; 1988. [Google Scholar]
- 42. Thomson DR, Shitole S, Shitole T, et al. . A system for household enumeration and re-identification in densely populated slums to facilitate community research, education, and advocacy. PLoS One 2014; 9:e93925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Verutes GM, Fiocco MB, Weeks JR, Coulter LL. Health, poverty, and place in Accra, Ghana: mapping neighborhoods. J Maps 2012; 8:369–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.