Abstract
Background
Sub-Saharan Africa bears a disproportionate burden of HIV infection. Knowledge of the spatial distribution of HIV outcomes is vital so that appropriate public health interventions can be directed at locations most in need. In this regard, spatial clustering analysis of HIV-related mortality events has not been performed in a rural sub-Saharan African setting.
Methodology and Results
Kulldorff’s spatial scan statistic was used to identify HIV-related and all-cause mortality clusters (p<0.05) in a population-based demographic surveillance survey in rural KwaZulu Natal, South Africa (2000–2006). The analysis was split pre (2000–2003) and post (2004–2006) rollout of antiretroviral therapy, respectively. Between 2000–2006 a total of 86,175 resident individuals ≥15 years of age were under surveillance and 5,875 deaths were recorded (of which 2,938 were HIV-related) over 343,060 person-years of observation (crude all-cause mortality rate 17.1/1000). During both time periods a cluster of high HIV-related (RR = 1.46/1.51, p = 0.001) and high all-cause mortality (RR = 1.35/1.38, p = 0.001) was identified in peri-urban communities near the National Road. A consistent low-risk cluster was detected in the urban township in both time periods (RR = 0.60/0.39, p = 0.003/0.005) and in the first time period (2000–2003) a large cluster of low HIV-related and all-cause mortality in a remote rural area was identified.
Conclusions
HIV-related and all-cause mortality exhibit strong spatial clustering tendencies in this population. Highest HIV-related mortality and all-cause mortality occurred in the peri-urban communities along the National Road and was lowest in the urban township and remote rural communities. The geography of HIV-related mortality corresponded closely to the geography of HIV prevalence, with the notable exception of the urban township where high HIV-related mortality would have been expected on the basis of the high HIV prevalence. Our results suggest that HIV treatment and care programmes should be strengthened in easy-to-reach high density, peri-urban populations near National Roads where both HIV-related and all-cause mortality are highest.
Introduction
Identification of areas with excess health problems is important so that appropriate public health interventions can be directed at these locations [1], [2]. Since many diseases are related to location, geographic information systems (GIS) through spatial analysis can identify problem areas or locations for public health effort especially in poor, remote or rural settings which have limited available resources. Evidence suggests that geographic approaches to control and prevention may enhance public health efforts [3] and improve public health care delivery systems. The development of technologies, such as GIS, and the advancement of spatial statistics have allowed the application of not only disease mapping but also spatial analyses, such as spatial clustering, in epidemiological research [4]–[7]. In this context, clusters are defined as a statistically significant excess or deficit of events relative to expectation [8].
There are a large number of studies that have used spatial clustering methods to identify clusters of deaths due to non-communicable diseases such as cancer in developed country contexts, for example, Rosenberg et al [4], and Jemal et al [7]. There are also examples of work that have used spatial clustering methods to help understand the epidemiology of infectious diseases in rural African settings. For example, in our study population, we previously demonstrated substantial geographical heterogeneity in the prevalence of HIV infection [9]. The highest prevalence occurred in peri-urban communities located near the National Road. Snow et al [10], report a space-time clustering of severe childhood morbidity on the Kenyan coast with seasonal peaks in incidence of severe malaria comprising discrete mini-epidemics. Similar studies on micro-epidemiology of malaria were done in Nouna, rural Burkina Faso [1] and the results are likely to help better understand the observed clustering of mortality in the area. However, for deaths related to infection with HIV, spatial clustering has not previously been done at a truly local level.
Here we use population-based data from KwaZulu-Natal, South Africa to detect statistically significant all-cause and HIV-related mortality clusters in a typical rural South African population between 2000 and 2006. In this setting, HIV with or without TB remains the major cause of death among the population [11] and residents are at a high-risk of HIV infection.
Methods
Study Area and Population
The study area is located near the market town of Mtubatuba in the Umkanyakude district of KwaZulu-Natal province, South Africa (Figure 1) and has been described in detail previously [12]. The population is almost exclusively Zulu-speaking and the study area was part of a former “homeland” under apartheid and is still characterized by high levels of circulatory migration [13]. The adult unemployment rate is 67% and the district of Umkhanyakude has the lowest score of any district in KwaZulu-Natal on United Nation’s Human Development Index [14]. The demographic surveillance site (DSS) was established in 2000 and since then has followed vital events (including all deaths) in a complete population of 87,000. The surveillance area is 438 km2, and is typical of many rural areas of South Africa in that while predominantly rural, it contains an urban township and informal peri-urban settlements [12]. All homesteads (N ≈ 12,000) within the Africa Centre’s demographic surveillance area have been mapped via global positioning system (GPS) technology to an accuracy of <2 m (Figure 2). The area is characterized by high HIV prevalence and HIV incidence [15]. Starting in 2004 there has been a rapid roll-out of anti-retroviral therapy (ART), [15], [16] through a nurse-led, devolved public-sector programme described previously [17].
The Africa Centre Demographic Information System (ACDIS)
ACDIS has been described in detail elsewhere [12]. Briefly, the ACDIS database stores longitudinal health and socio-demographic data on registered subjects, physical structures (e.g. homesteads, clinics and schools) and households [12]. The information is updated every six months through fieldworker visits. The events that are recorded include: individual events (death, birth, migration, etc) household events (household formation, migration, change of household head), and events affecting individual homesteads (start of a new building, change of a building’s main purpose or its owner).
All deaths across the surveillance area are subject to a verbal autopsy interview. The detailed description of this methodology has been described and validated previously [18]. In brief, trained nurses conduct interviews with the caregivers of the deceased. The interview includes an open disease history, a checklist of signs and symptoms, and a structured questionnaire [18]. Two most experienced clinicians then independently assign the cause of death. HIV and TB were combined as a cause of death as it is often difficult to distinguish between them using a verbal autopsy and this improves the sensitivity of the tool. The resulting ICD-10 (10th revision of the International Classification of Diseases) codes were grouped into global burden of disease groups I, II and III [19] with the exception of tuberculosis and AIDS diagnoses, which were classified together into a separate group as HIV-related deaths, given the extensive overlap in mortality from HIV infection and tuberculosis [20]. Ethical clearance for the demographic surveillance and collection of verbal autopsies has been obtained from the University of KwaZulu-Natal’s Ethics Committee.
Spatial Analysis
The mortality event and person-time data was used to describe the all-cause mortality and HIV-related mortality patterns for the adult (≥15 years of age) population. We aggregated mortality events and person-years of observation (PYO) by fieldworker area (48 in total across the study area). Individuals contributed to the person-time denominator from 1 January 2000, or from any later date at which they attained the age of 15, or through in-migration until 31 December 2006, and ceased to contribute to the denominator at death or out-migration. For this analysis we only used person-time accrued and deaths observed whilst the individual was resident at a homestead within the surveillance area. We then calculated directly standardized mortality rates for each fieldworker area to eliminate the influence of age and sex composition by standardizing against the ACDIS population for the period 2000–2006. We displayed this information in the form of thematic maps for both all-cause and HIV-related mortality from 2000–2003 (period before introduction of ART and 2004–2006 (period after introduction of ART), respectively.
Cluster Detection
Similarly, all mortality events and person-years of observation were summed for all individuals ≥15 years of age in each homestead (mapped to an accuracy of <2 m). We then applied Kulldorff’s spatial scan statistic (Poisson model) implemented in SaTScan software version 7.0 to perform the spatial analysis scanning to detect mortality clusters (high or low mortality clusters) across the surveillance area. A spatial scan statistic is a cluster detection test that is able to both detect the location of clusters and evaluate their statistical significance [21]. This was done by gradually scanning a window across time and/or space across the study area, noting the number of observed and expected observations inside the window at each location using a Poisson model [21], [22]. For any given position of the centre, the radius of the circle changes continuously so that it can take any value. For each circle, the spatial scan statistic calculates the likelihood of observing the observed number of cases inside and outside the circle [21], [23]. The circle with the maximum likelihood is the most likely cluster (least likely to have occurred by chance).
The spatial clustering method [24] offers several advantages: it corrects for multiple comparisons, adjusts for the heterogeneous population densities among the different areas in the study, detects and identifies the location of the clusters without prior specification of their suspected location or size thereby overcoming pre-selection bias, and the method allows for adjustment of covariates [22], [25]. The p-value of the statistic is obtained through Monte Carlo hypothesis testing (9999 iterations), where the null hypothesis of no cluster is rejected at an α level of 0.05 exactly if the simulated p-values is ≤0.05 for the most likely cluster [21], [23]. We limited any possible cluster so it would not exceed 50% of the total population at risk but in practice such a constraint is unlikely to have any bearing on the results given the highly localized spatial heterogeneity in HIV prevalence observed in this population [9]. We ran the analysis by two time periods described previously (2000–2003 and 2004–2006) for HIV-related and all-cause adult population scanning for clusters of high or low mortality rates.
Results
Over the seven year period (2000–2006) we observed a total of 86,175 resident individuals ≥15 years of age and 5,875 deaths were recorded (2,938 were HIV-related) over 343,060 PYO. The crude all-cause mortality rate in this population cohort was 17.1 (95% CI 16.7–17.6) deaths per 1000 PYO and was higher in males (18.1 deaths per 1000 PYO) in comparison to females (17.3 deaths per 1000 PYO). Across the two time periods, the crude mortality rate decreased slightly from 17.3 (95% CI 16.7–17.8) deaths per 1000 PYO between 2000–2003 to 17.0 (95% CI: 16.3–17.6) deaths per 1000 PYO between 2004–2006. HIV-related deaths accounted for 45% of deaths in males over 53% for females respectively (Figure 3). Although the HIV-related mortality rate was similar for males and females (≈8.6 per 1000 PYO), HIV-related deaths make up a larger proportion of deaths in females due to the fact that other causes of death such as injury-related deaths are higher in males [18], [26].
The results of cluster analysis for adult HIV-related mortality are shown in Figure 4. All results were adjusted for age and sex. Superimposed on the map of age-standardized HIV-related mortality rates are the clusters identified by the Kulldorff spatial scan statistic for the two time periods (A: 2000–2003 and B: 2004–2006). HIV-related mortality exhibits strong spatial clustering tendencies as measured by the Kulldorff’s spatial scan statistic for time periods 2000–2003 (A, cluster 1, RR = 1.46, p = 0.001, 836 observed mortality cases, 673.24 expected) and 2004–2006 (B, cluster 1, RR = 1.51, p = 0.001, 323 observed cases, 237.30 expected) respectively. A consistent low-risk cluster was detected around KwaMsane township across both time periods (A, clusters 2 and 3, RR = 0.60, 0.39, p = 0.014, 0.003, B, cluster 2, RR = 0.45, p = 0.005). The Highest HIV-related mortality occurred in the peri-urban communities along the National Road (Figure 5).
The results of cluster analysis for all-cause mortality are shown in Figure 6. Superimposed on the map of age-standardized all-cause mortality rates are the clusters identified by the Kulldorff spatial scan statistic for the two time periods (A: 2000–2003 and B: 2004–2006). Kulldorff’s spatial scan statistic has identified statistically significant clusters with high and low relative risk for both time periods. For time period 2000–2003 (A, cluster 1) a statistically significant cluster was detected with a high RR of 1.35 (p = 0.001): 1194 total observed cases (973.66 expected). Low relative risk clusters was also detected (A, clusters 2 and 3, RR = 0.31, 0.44, p = 0.041, 0.042). For time period 2004–2006 (B, cluster 1) a statistically significant cluster was detected with a high RR of 1.38 (p = 0.001): 660 total observed cases (518.07 expected). A low relative risk cluster was also detected for the period 2004–2006 (cluster 2) with a RR of 0.21, p = 0.017.
All-cause mortality clusters were located in similar locations as the HIV-related mortality clusters. This largely reflects the fact that HIV-related mortality is the major cause of death in this population and hence the spatial patterns of all-cause mortality are influenced by the underlying spatial distribution of HIV-related mortality.
Discussion
We have investigated for the first time localized spatial clustering of HIV-related and all-cause mortality events in adults (≥15 years of age) in a typical rural South African population with high HIV-prevalence. Both HIV-related and all-cause mortality are not homogenously (randomly) distributed in this population and exhibit strong spatial clustering tendencies as measured by the Kulldorff spatial scan statistic. Highest HIV-related and all-cause mortality occurred in the peri-urban communities along the National Road and were lowest in the urban township and remote rural communities. Overall, these results provide a clear rationale for the need to strengthen HIV treatment and care programmes in the high density peri-urban communities along the National Road.
Our results reinforce the substantial contribution of HIV to all-cause mortality in this population with half of all adult deaths over the study period being HIV-related. Our findings are supported by previous research in the same population which showed a reduction in HIV-related mortality post 2004 [26]. The high HIV mortality clusters identified in this study corresponded approximately with areas of high HIV prevalence identified previously in this population in communities closest to the National Road [9]. Indeed, a strong ecological relationship between proximity to roads and HIV prevalence among women attending antenatal clinics has been previously demonstrated in this setting [27]. Interestingly, the low mortality cluster around the urban township (close to the National Road) corresponded to a cluster of relatively high HIV prevalence. This could be a consequence of a combination of better living conditions, [28], [29] and easier access to HIV treatment and care in the urban area in comparison to the surrounding peri-urban communities. The locations of high and low mortality communities were remarkably similar across the two time periods. This reflects the fact that mortality rates in the peri-urban communities were always relatively higher than the surrounding population (and the converse was true for the urban and deep rural communities) despite the fact that overall mortality had decreased slightly across the two time periods. In line with our previous work [9], the results provide further evidence to challenge the paradigm of a ubiquitous ‘generalized’ rural epidemic. Rather the marked spatial variation in mortality in this population is likely a consequence of the fact that several localized HIV sub-epidemics occur in this population that are partly contained within geographically defined communities. Consequently, resources should not be distributed in a purely uniform manner in such settings where clear, marked geographical variations in mortality exist.
The last two decades have seen substantial work in the GIS arena on the statistical analysis of point patterns, e.g, Besag and Newell (1991) [3], and spatial clustering is one of the many statistical analysis used for point pattern analysis. Spatial clustering detection methods are classified as global, local and focused [3], [30]. In this present study we chose the local spatial scan statistic (Kulldorff’s spatial scan statistic) over others [5], [31], as it is suitable for the purpose of this present study because it determines the existence of statistically significant clusters and their geographic locations. Quite often public health authorities need to respond to demands to investigate potential clusters of different diseases and confirm or refute, with certainty, whether a health problem exists in a particular location [23]. The application of GIS and spatial scan statistics can help health workers better understand the HIV-related mortality patterns both in space and time.
A strength of our cluster detection approach is that we do not aggregate the data by arbitrary (with respect to the “boundary” of the cluster) administrative units but instead use person-exposure and mortality events at the level of individual homesteads (accuracy <2 m) to define the "clusters". A limitation of the analysis using the Kulldorff’s spatial scan statistic is that clusters are defined as circles [1], [10]. For example, if a homestead with a lower mortality rate is located next to or is surrounded by homesteads with a higher mortality rates it is more likely that the homestead with a lower mortality rate will always be included in the cluster [32], [33].
The results of this present study can be used to target high-risk communities for public health intervention and prioritize the areas that need thorough epidemiological investigation. Although the detailed datasets used in this analysis are unlikely to be available at a National level, the principal elucidated in our work suggests that HIV treatment programmes should be strengthened in easy-to-reach high density, peri-urban populations near National Roads where mortality rates are highest.
Acknowledgments
Our special thanks go to the management and staff of the Africa Centre for Health and Population Studies demographic surveillance site for using their data for the purpose of this research project. EN would like to thank the School of Public Health, Witwatersrand University, Johannesburg, South Africa, Papua New Guinea Institute of Medical Research and the Wosera demographic surveillance site for their support over the entire training programme. Last but not least, EN would like to thank FT for his contribution towards this paper.
Funding Statement
Elias Namosha was supported by the UNICEF/UNDP/World Bank/WHO Special Programme for Research and Training in Tropical Disease (TDR). Frank Tanser was supported by grant 1R01-HD058482-01 from the National Institute of Child Health and Human Development (NICHD). Funding for the Africa Centre's Demographic Surveillance Information System and Population-based HIV Survey was received from the Wellcome Trust, UK (grant # 082384/Z/07/Z)]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Sankoh OA, Yazoume Y, Saueborn R, Muller O, Becher H (2001) Clustering of childhood mortality in rural Burkina Faso. International Journal of Epidemiology 30: 485–492. [DOI] [PubMed] [Google Scholar]
- 2. Fukuda Y, Umezaki M, Nakamura K, Takano T (2005) Variations in societal characteristics of spatial disease clusters: examples of colon, lung and breast cancer in Japan. International Journal of Health Geographics 4: 16 doi:10.1186/1476-072X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Besag J, Newell J (1991) The detection of clusters in rare diseases. J Roy Stat Soc, Ser A 154: 143–155. [Google Scholar]
- 4. Rosenberg MS, Sokal RR, Oden NL, DiGiovann D (1999) Spatial autocorrelation of cancer in Western Europe. Eur J Epidemiol 15: 15–22. [DOI] [PubMed] [Google Scholar]
- 5. Kulldorff M, Feuer EJ, Miller BA, Freedman LS (1997) Breast cancer clusters in the northeast United States: a geographic analysis. Am J Epidemiol 146: 161–170. [DOI] [PubMed] [Google Scholar]
- 6. Green C, Hoppa RD, Young TK, Blanchard JF (2003) Geographic analysis of diabetes prevalence in an urban area. Soc Sci Med 57: 551–560. [DOI] [PubMed] [Google Scholar]
- 7. Jemal A, Kulldorff M, Devesa SS, Hayes RB, Fraumeni F (2002) A geographic analysis of prostate cancer mortality in the United States, 1970–89. Int J Cancer 101: 168–174. [DOI] [PubMed] [Google Scholar]
- 8. Jacky MJ, Frank CC, David C, Jonathan ME (2005) Geographic identification of high Gonorrhea transmission areas in Baltimore, Maryland. Am J Epidemiol 161: 73–80. [DOI] [PubMed] [Google Scholar]
- 9. Tanser F, Barnighausen T, Cooke GS, Newell M-L (2009) Localized spatial clustering of HIV infections in a widely disseminated rural South African epidemic. International Journal of Epidemiology 38: 1008–1016 doi:10.1093/ije/dyp148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Snow R, Armstrong-Schellenberg JRM, Peshu N, Foster D, Newton CRJC, et al. (1993) Periodicity and space-time clustering of severe childhood malaria on the coast of Kenya. Trans R Soc Trop Med Hyg 87: 386–90. [DOI] [PubMed] [Google Scholar]
- 11.Muhwava W, Nyirenda M (2008) Demographic and Socio-Economic Trends in the ACDIS. Monograph Series No. 2, Africa Centre for Health and Population Studies, Somkhele, South Africa.
- 12. Tanser F, Hosegood V, Barnighausen T, Herbst K, Nyirenda M, et al. (2008) Cohort Profile: Africa Centre Demographic Information System (ACDIS) and population-based HIV survey. Int J Epidemiol 37: 956–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Lurie MN, Williams BG, Zuma K, Mkaya-Mwamburi D, Garnett G, Sturm AW, et al. (2003) The impact of migration on HIV-1 transmission in South Africa: a study of migrant and nonmigrant men and their partners. Sex Transm Dis 30(2): 149–56. [DOI] [PubMed] [Google Scholar]
- 14.Municipality UD (2011) Umkhanyakude district municipality integrated development plan (2012/2013–2016/17). Mkuze: Umkhanyakude district municipality.
- 15. Tanser F, Barnighausen T, Grapsa E, Zaidi J, Newell ML (2013) High coverage of ART associated with decline in risk of HIV acquisition in rural KwaZulu-Natal, South Africa. Science 2013 339(6122): 966–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Cooke GS, Tanser FC, Barnighausen TW, Newell ML (2010) Population uptake of antiretroviral treatment through primary care in rural South Africa. BMC Public Health 10: 585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Houlihan CF, Bland RM, Mutevedzi PC, Lessells RJ, Ndirangu J, Thulare H, et al. (2011) Cohort Profile: Hlabisa HIV Treatment and Care Programme. International Journal of Epidemiology 40(2): 318–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Hosegood V, Vanneste A-M, Timaeus IM (2004) Levels and causes of adult mortality in rural South Africa. AIDS 18: 663–671. [DOI] [PubMed] [Google Scholar]
- 19.Murray CJ, Jamison DT, Lopez AD, Ezzati M, Mathers CD (2006) Global Burden of Disease and Risk Factors: Washington, DC: World Bank and Oxford University Press. [PubMed]
- 20. Gandhi NR, Moll A, Sturm AW, Pawinski R, Govender T, Lalloo U, et al. (2006) Extensively drug-resistant tuberculosis as a cause of death in patients co-infected with tuberculosis and HIV in a rural area of South Africa. The Lancet 368(9547): 1575–1580. [DOI] [PubMed] [Google Scholar]
- 21. Kulldorff M (1997) A spatial scan statistic. Communications in Statistics: Theory and Methods 26: 1481–1496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Kulldorff M, Nagarwalla N (1995) Spatial disease clusters: detection and inference. Stat Med 14: 799–810. [DOI] [PubMed] [Google Scholar]
- 23. Agricola O, Wayne MS, Pascal M, Dean M, John H, Jeff W (2004) Investigation of clusters of giardiasis using GIS and Spatial scan statistic. International Journal of Health Geographics 3: 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kulldorff M, Rand K, Gherman G, Williams G, DeFrancesco D (1998) Software for the spatial and space-time scan statistics. SaTScan & M. Version 2.1. National Cancer Institute Bethesda, USA.
- 25. Kulldorff M (2001) Prospective time periodic geographical disease surveillance using a scan statistic. J R Stat Soc Ser A 164: 61–72. [Google Scholar]
- 26. Herbst AJ, Cooke GS, Bärnighausen T, KanyKany A, Tanser F, Newell ML (2009) Adult mortality and antiretroviral treatment roll-out in rural KwaZulu-Natal, South Africa. Bull World Health Organ 87(10): 754–762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Tanser F, LeSueur D, Solarsh G, Wilkinson D (2000) HIV heterogeneity and proximity of homestead to roads in rural South Africa: an exploration using a geographical information system. Tropical Medicine and International Health 5: 40–46. [DOI] [PubMed] [Google Scholar]
- 28.Hosegood V, Timaeus IM (2005) Household composition and dynamics in KwaZulu Natal, South Africa: mirroring social reality in longitudinal data collection. In: van der Walle E, editor. African Households: an exploration of census data. New York: Sharpe Inc; p.58–77.
- 29. Bärnighausen T, Hosegood V, Timaeus IM, Newell ML (2007) The socioeconomic determinants of HIV incidence: evidence from a longitudinal, population-based study in rural South Africa. Aids 21 Suppl 7S29–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lawson AB (2001) Statistical Methods in Spatial Epidemiology. In: Probability and Statistics, John Wiley & Sons 298.
- 31. Kulldorff M, Athas WF, Feurer EJ (1998) Evaluating cluster alarms: a space-time scan statistic and brain cancer in Los Alamos, New Mexico. Am J Public Health 88: 1377–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Michaud J, Ellen JM, Johnson SM (2003) Responding to a community outbreak of syphilis by targeting sex partner meeting location: an example of a risk-space intervention. Transm Dis 30: 533–8. [DOI] [PubMed] [Google Scholar]
- 33.Wakefield J, Kelsall JE, Morris SE (2002) Clustering, cluster detection, and spatial variation in risk: In: Spatial epidemiology: methods and applications. New York: Oxford University Press: 128–52.