Abstract
Genomic sequencing has revolutionized our understanding of bacterial disease epidemiology, but remains underutilized for zoonotic pathogens in remote endemic settings. Anthrax, caused by the spore-forming bacterium Bacillus anthracis , remains a threat to human and animal health and rural livelihoods in low- and middle-income countries. While the global genomic diversity of B. anthracis has been well-characterized, there is limited information on how its populations are genetically structured at the scale at which transmission occurs, critical for understanding the pathogen’s evolution and transmission dynamics. Using a uniquely rich dataset, we quantified genome-wide SNPs among 73 B. anthracis isolates derived from 33 livestock carcasses sampled over 1 year throughout the Ngorongoro Conservation Area, Tanzania, a region hyperendemic for anthrax. Genome-wide SNPs distinguished 22 unique B. anthracis genotypes (i.e. SNP profiles) within the study area. However, phylogeographical structure was lacking, as identical SNP profiles were found throughout the study area, likely the result of the long and variable periods of spore dormancy and long-distance livestock movements. Significantly, divergent genotypes were obtained from spatio-temporally linked cases and even individual carcasses. The high number of SNPs distinguishing isolates from the same host is unlikely to have arisen during infection, as supported by our simulation models. This points to an unexpectedly wide transmission bottleneck for B. anthracis , with an inoculum comprising multiple variants being the norm. Our work highlights that inferring transmission patterns of B. anthracis from genomic data will require analytical approaches that account for extended and variable environmental persistence, as well as co-infection.
Keywords: Bacillus anthracis endemic anthrax, phylogenomics, simulation modelling, within-host diversity
Data Summary
Isolate Sequence Read Archive (SRA) accession numbers, along with metadata and sequence quality metrics, are available at the University of Glasgow’s data repository, Enlighten: http://dx.doi.org/10.5525/gla.researchdata.1217. Scripts used in this study can be accessed on GitHub at https://github.com/matejmedvecky/anthraxdiversityscripts and https://github.com/tristanpwdennis/anthrax_diversity.
Impact Statement.
Pathogens transmitted between animals and people affect the health and livelihoods of farmers, particularly in developing countries dependent on livestock. Understanding over what distances these pathogens are transmitted and how they evolve is important to inform control strategies towards reducing disease impacts. Information on the circulation of Bacillus anthracis , which causes the often-lethal disease anthrax, is lacking for settings where the disease is commonplace. Consequently, we examined its genetic variability in an area in Tanzania where anthrax is widespread, including cases occurring closely both in space and time, and within individual livestock carcasses. We found no clear link between how closely cases were sampled and their genetic similarity. We suspect this lack of congruence is primarily driven by large-scale livestock movements, which control efforts should take into consideration, and long periods during which infectious bacteria lie dormant. Another significant finding was the co-occurrence of multiple B. anthracis types within individual hosts, suggesting animals are commonly infected with a mixture of variants. This needs to be accounted for when investigating possible connections between cases.
Introduction
Genomic data have the potential to transform our understanding of the evolution and epidemiology of pathogens of public-health importance [1]. However, this potential has yet to be fully harnessed for many zoonotic diseases that occur in hard-to-reach areas. Anthrax remains endemic in many low- and middle-income countries (LMICs) worldwide [2]. It is a disease characterized by sudden deaths in herbivorous livestock and wildlife, and can also cause serious, potentially fatal disease in people [3]. Anthrax is classified among the neglected zoonoses: a group of diseases shared by animals and people that, due to their occurrence in remote, disadvantaged communities, collectively receive less than 0.1 % of international global health assistance [4]. As for many neglected zoonoses, there is little genomic data for Bacillus anthracis , the bacterium that causes anthrax, from endemic LMIC settings where surveillance tends to be limited. Such data could help to improve our understanding of transmission processes, such as how B. anthracis is spread within and between outbreaks, and ultimately contribute to more informed disease management.
The genomic diversity of B. anthracis has been well-described at a global scale. Isolates can be broadly divided into three major clades (A, B, C), of which the A clade is the most widespread and globally dominant [5–7]. Isolates from most B. anthracis lineages have been found across geographically widespread areas, often spanning multiple continents [7]. While particular variants often predominate regionally, high lineage diversity has also been reported, including co-circulation of strains from multiple linages [8, 9]. How B. anthracis diversity is structured at smaller scales is less well defined. The pathogen has limited genomic diversity compared to other bacterial species (i.e. is genetically monomorphic), rendering standard genotyping methods such as multilocus sequence typing insufficiently discriminatory [10]. Therefore, a hierarchical genotyping scheme known as PHRANA was developed specifically for B. anthracis , based on quickly evolving repetitive regions nested within more phylogenetically stable markers – canSNPs – that distinguish among the major lineages [5, 11]. Variants of this scheme have been used to examine the diversity of B. anthracis in several endemic settings globally, including in a few African countries [8, 12–14]. Genome-wide SNP data would offer higher resolution for discriminating among closely related isolates. However, whole-genome sequencing (WGS) features in only a few studies of local B. anthracis diversity [15–18], and has rarely been conducted outside Europe [19], so the potential for phylogenomic data to be used to understand transmission patterns within hyperendemic areas has yet to be explored.
Transmission of B. anthracis occurs primarily through the environment. After causing the death of the animal host, vegetative bacteria are released into the environment via bodily fluids. Here, upon exposure to oxygen and cues related to a lack of nutrients, these bacteria sporulate and can persist in a dormant yet infectious state for several decades [3, 20]. While the viability of spores decreases over time, new B. anthracis infections could theoretically arise from recent cases or cases that occurred several years or even decades previously. How this environmental persistence shapes the spatio-temporal diversity of B. anthracis in endemic settings has never been investigated.
In molecular epidemiological studies of bacterial pathogens, a single isolate is typically sequenced from each individual case. However, this approach fails to recognize the bacterial population diversity that may exist within the host [21, 22]. Such diversity can either result from mutations that arise during infection, or from heterogeneity (multiple variants) in the inoculum, either through co-infection (exposure to multiple variants simultaneously) or superinfection (multiple exposures) [23, 24]. Under those scenarios, a single isolate is unlikely to represent the overall diversity of the pathogen within the host, and as a result this approach can lead to erroneous inferences about transmission pathways [25]. The importance of capturing within-host diversity is, therefore, increasingly recognized [26, 27]. In the case of anthrax, multiple genotypes of B. anthracis have been previously found within individual hosts [28–30], but it remains unclear whether this represents a more widespread phenomenon.
The objective of this study was to quantify the genomic diversity of B. anthracis at hierarchical spatial scales within the livestock population of a hyperendemic setting. This was accomplished using a unique dataset including: (i) isolates collected throughout a large (~8300 km2) area of northern Tanzania where anthrax is widespread; (ii) among spatio-temporally linked cases; and (iii) within individual hosts, assessing multiple isolates from the same and different sample types associated with a case (e.g. tissue, blood, soil).
Methods
Study area
This study was conducted in the Ngorongoro Conservation Area (NCA), part of the Serengeti ecosystem in northern Tanzania, which covers 8292 km2. This multiple-land-use area is inhabited by roughly 87 000 people [31] and one million livestock (sheep, goats and cattle) (C. C. Chubwa,Veterinary Officer for Ngorongoro District, personal communication). Northern Tanzania remains hyperendemic for anthrax [32], and prior to this study the NCA was recognized as a potential hotspot for this disease [33].
Sample collection
Samples were collected between May 2016 and April 2017 inclusive through active surveillance by a dedicated field team. Sudden deaths in animals reported by community members throughout the NCA were investigated and samples were collected when anthrax was suspected (File S1, available with the online version of this article). When available, the following samples were collected: a piece of skin tissue (tip of the ear if the carcass was still intact, or a piece of hide if the carcass had already been opened); whole blood; swab of blood or body fluid at natural orifices; blood or body-fluid soaked soil from below the carcass; and insects found on or around the carcass. Various metadata were recorded, including the species of animal affected and the location of sampling (File S1). All samples were stored at ambient temperature for up to 6 months at local veterinary facilities until transport to the Kilimanjaro Clinical Research Institute (KCRI) in Moshi, Tanzania, for molecular diagnostics, as previously described [34], with aliquots shipped to the University of Pretoria, South Africa, for selective culture and DNA extraction from B. anthracis isolates.
Selective culture, DNA extraction and sequencing
Sample pre-treatment (i.e. to inhibit competition from heat-sensitive bacteria) is described in File S1. Sample homogenates (100 µl) were plated onto both polymyxin-EDTA thallous acetate (PET) selective medium and 5 % sheep blood agar (SBA). These were incubated at 37 °C overnight and the plates inspected for growth after 15–24 h incubation. The PET medium was then further incubated and inspected at 48 h. Suspect B. anthracis colonies based on typical morphological characteristics were sub-cultured onto SBA for purification and identification (File S1). In parallel, a single colony was streaked onto a new purity plate for nucleic acid extraction. In some instances, multiple isolates were selected from the same sample where the colonies demonstrated differences in morphology but were identified on the same plate and met the selection criteria (File S1).
DNA extracts from 75 isolates from 33 carcasses were submitted for library preparation and sequencing at MicrobesNG (Birmingham, UK). Libraries were prepared using the Nextera XT v2 kit (Illumina) and sequenced on the Illumina HiSeq platform, generating 250 bp paired-end reads.
Bioinformatics and genomic analyses
Reads were adapter trimmed by MicrobesNG using Trimmomatic v0.30 [35], and basic statistics determined using quast [36]. Bacterial species identification was confirmed using Kraken [37]. Based on these quality metrics, sequences from two isolates were excluded from further analyses: one due to a low number of reads (<40 000), and another that was identified as Bacillus cereus . There were some further indications that not all cultures were pure B. anthracis , despite multiple rounds of sub-culture (File S1). A reference-based mapping approach and strict variant filtering criteria were implemented to minimize the issues associated with the sequence quality, while making use of as much of the data as possible.
Read mapping and variant calling were performed on the CLIMB computing platform for microbial genomics [38]. Trimmed reads were aligned to the chromosome of the Ames Ancestor reference genome (accession no. NC_007530) using bwa-mem (version 0.7.17). Picard was used to mark and remove duplicate reads, add read group information, and index the bam files [39]. Quality metrics for read mapping were obtained using Qualimap [40]. Mapped reads had a mean depth of coverage of 85× across isolates, ranging from 24× to 245× (median 72×). SNPs were detected in individual isolates by VarScan v2.4.4 [41] with parameters set as follows: minimum read depth of 4; minimum base quality of 20; variant allele frequency ≥0.95. Subsequent SNP curation steps are described in File S1. Custom Python scripts for the assessment of read mapping SNP metrics data, variant site filtering and generation of variant call and alignment files (source codes with description of their functionality and usage), along with the final variant call and multiple sequence alignment files, are available on GitHub (https://github.com/matejmedvecky/anthraxdiversityscripts).
The alignment of concatenated SNPs was analysed using ModelFinder to determine the most appropriate model of nucleotide substitution [42]. Subsequently, a maximum-likelihood phylogeny was estimated in iq-tree [43] under the Kimura-3-parameters (K3P) model, using 1000 ultrafast bootstrap replicates [44]. A distance matrix detailing SNP differences between isolates was constructed using snp-dists v0.6 (https://github.com/tseemann/snp-dists). The distance between GPS points was calculated using the pointDistance command in the R package raster [45]. Isolation by distance was tested using Mantel test to assess the correlation between SNP distance and Euclidean geographical distance within the R package adegenet [46]. All program versions and commands used, the distance matrix, as well as small custom scripts are available on GitHub (https://github.com/tristanpwdennis/anthrax_diversity).
To place the newly sequenced isolates within the global phylogeny of B. anthracis , 80 WGS data sets from GenBank were accessed (Table S1), and a core-genome alignment generated using Parsnp v1.1.2 [47]. The resulting phylogeny indicated that NCA isolates belong to the ancient A lineage. To further resolve the diversity among NCA isolates compared with other publicly available isolates from the same lineage, reads from eight additional isolates available from the Sequence Read Archive (SRA) from a study by Bruce et al. [7] were accessed using Fastq-dump from the SRA-toolkit: all isolates (n=4) belonging to the 3.2 linage, and two arbitrarily selected isolates from each of the 3.1 and 3.3 lineages (Table S2). These were run through our SNP-calling pipeline as described above, resulting in a sequence alignment file free of -/N characters, which was used to infer a phylogeny in RAxML v8.2.11 [48] using a GTR (general time reversible) model of nucleotide substitution, and using the Ames Ancestor reference genome as an outgroup.
Assessment of B. anthracis diversity at hierarchical spatial scales
The diversity of B. anthracis was assessed at multiple hierarchical spatial scales (Fig. 1). Carcasses for which detailed sampling location data were available (n=32) could be grouped into four distinct areas within the NCA (central, north, south, east), referred to herein as ‘geographical groups’. To assess the genomic relatedness among spatio-temporally linked cases, on four occasions, samples were collected from two carcasses either from the same or neighbouring households on the same or consecutive days, which we refer to as ‘epidemiological clusters’. Three of these clusters were in the central geographical group, while the fourth was in the southern group of carcasses sampled. Finally, multiple isolates (n=2–4) were sequenced from a single carcass for 21 carcasses. These were either (i) isolates from multiple sample types (i.e. tissue, blood, swabs, soil and/or insects) from a single carcass (n=16 carcasses) and/or (ii) multiple isolates from the same sample (n=15 carcasses).
Simulation modelling
To assess the likelihood of different levels of within-host diversity arising during the course of infection, we performed simulation modelling of B. anthracis infection with a homogenous inoculum of varying size (1, 2, 5, 10, 20, 50 and 100 bacterial genomes) to reflect uncertainty in anthrax infectious dose, and across a range of replication cycles (up to 25 generations). In each bacterial generation, genomes underwent a round of replication followed by cell division and accordingly population size doubled in each generation. With an infectious dose d, discrete generations and no die-off, the population size in generation i is therefore . The number of mutations occurring in the replication of each genome was drawn from a Poisson distribution parameterized to reflect the estimated B. anthracis mutation rate (i.e. λ=0.0043). This genome-level mutation rate is based on the genome size of 5.2 million bp and a mutation rate of 8.3×10−10 mutations per site [29]; this represents an upper estimate of the mutation rate and was chosen as we were interested in estimating the upper limits on reasonable expectation of diversity emerging during the course of an infection initiated by a homogenous dose. Results were extrapolated to 40 generations, proposed to be the upper limit on the number of replication cycles during an infection [11]. Details of simulations run are provided in File S1. Pairs of genomes were repeatedly sampled from these simulated populations and the count of mutations separating each pair was calculated. Sampling was performed 100 times per generation for each simulation. Simulations, sampling of simulated populations, and linear models summarizing trends in the outcome of these processes were performed in R [49].
Results
B. anthracis within the NCA is limited to a single subgroup lacking a clear phylogeographical signal
WGS data were obtained from 73 B. anthracis isolates from a total of 33 carcass sites sampled throughout the NCA; all isolate metadata are available at http://dx.doi.org/10.5525/gla.researchdata.1217. Carcasses were of the following species: sheep (n=18), cattle (n=7), goats (n=4), donkey (n=1) or unknown host (n=3). All B. anthracis isolates from the NCA were found to belong to the ancient A subgroup of clade A (Fig. 2). Within this subgroup, the NCA isolates formed a monophyletic clade within cluster 3.2 as defined by Bruce et al. [7], which also contained the isolate A2075 (GenBank accession no. SRR2968187), isolated in 1999 from a baboon in Muhesi Game Reserve, central Tanzania.
A total of 125 SNPs that differed among the NCA isolates and A2075 were retained for analysis, of which 13 were unique to A2075. Twenty-two unique genotypes (i.e. SNP profiles) were found among the 73 NCA isolates (Fig. 3a–c). Based on a rarefaction analysis, the observed genotypic diversity was close to that present throughout the study area (i.e. further sampling would have been unlikely to reveal additional genotypes; Fig. S1). The maximum pairwise nucleotide difference between any two NCA-derived isolates was 49 SNPs [median=24, interquartile range (IQR)=10–35] (Fig. S2). There was no clear relationship between the pairwise nucleotide differences and the geographical distance between sampling locations (Fig. S3), as confirmed by a test for isolation by distance (r=0.04, P=0.214). Isolates from the central, eastern and southern geographical groups were observed throughout the phylogenetic tree, while all isolates from the northern sampling area were restricted to a single clade that contained the majority of NCA isolates (Fig. 3). In eight instances, identical B. anthracis genotypes were found in carcasses from different geographical groups, all of which involved isolates from the central geographical group and one of the other areas, with all areas implicated (Table S3). Identical or nearly identical SNP profiles (one SNP difference) were obtained from carcasses sampled 3–5 months apart on six occasions, and 10 months apart on one occasion (Table S4).
B. anthracis isolates from spatio-temporally linked cases rarely have identical SNP profiles
Between one and four isolates were sequenced from each carcass sampled as part of an epidemiological cluster (i.e. pair of spatio-temporally linked carcasses), resulting in 4–8 isolates per cluster. In most cases, isolates deriving from the same epidemiological cluster were phylogenetically unrelated. Only in one of the four clusters examined (C2) did both carcasses contain isolates with identical core SNP profiles (Fig. 3c). Overall, isolates from different carcasses within the same epidemiological cluster had similar numbers of SNP differences when compared with randomly selected carcasses (linked cases – median=21, IQR=0–33; unlinked cases – median=23, IQR=10–35) (Fig. 4).
Within-host diversity of B. anthracis is similar to between-host diversity in the NCA
Overall, the number of pairwise SNP differences between isolates from the same carcass was lower than that found between isolates from different carcasses (same – median=11, IQR=0–34; different – median=23, IQR=10–35) (Fig. 4). However, isolates with multiple distinct genotypes were obtained on 15 of 21 occasions wherein multiple isolates were sequenced from the same carcass, with as many as 43 SNP differences between isolates (Fig. 5). A high level of divergence was seen, regardless of whether isolates were from the same or different sample type (e.g. multiple isolates from a tissue sample versus isolates from tissue and soil samples; Fig. 5a). Divergent genotypes were observed in all carcasses from which three or more isolates were sequenced (12/12), and in a third (3/9) of carcasses for which two isolates were sequenced. A single SNP difference separated one isolate from two others within one carcass (AN16-83) and two SNP differences separated two isolates from a single soil sample (LNA). All other pairwise within-host SNP differences were eight or above. All of the within-host SNP profiles detected were shared with those from other carcasses.
Observed within-host diversity is unlikely to have arisen during the course of infection
Averaging across simulations with varying initial doses, in the 25th generation, 90.0 % of the population was identical to the infecting dose, 9.53 % differed by one SNP and 0.489 % by two SNPs, with higher numbers of mutations very rare (<0.01 %). While stochasticity was present in early generations and particularly at low inoculum doses (Figs S4–S6), the mean proportion of genomes with various numbers of SNPs changed predictably after the initial generations (Fig. 6a, b). Extrapolating to the 40th generation, then ~84 % of the population would be expected to be identical to the infecting dose,~15 % differing by one SNP and ~0.9 % by two SNPs.
To illustrate how this within-host generated diversity would be captured in our sampling, pairs of genomes were repeatedly sampled from simulated populations and pairwise differences calculated (Fig. 6c, d, File S1). Overall, pairwise differences greater than two SNPs occurred in less than 0.2 % of all simulated populations (Table S5). Higher SNP distances such as those observed (Fig. 5b) are, therefore, unlikely to arise within-host following infection with a homogenous inoculum.
Discussion
Genomic data for understanding the population structure and transmission patterns of bacterial zoonoses has been limited for LMIC settings where these diseases tend to have the greatest impact. Despite B. anthracis having limited genomic diversity in comparison with other bacterial species, WGS provided sufficient resolution to discriminate among isolates collected from within a relatively small geographical area of a few thousand square kilometres. The way in which the genomic diversity was partitioned across hierarchical spatial scales within this area has a series of novel implications for our understanding of how the pathogen is transmitted and evolves during endemic circulation.
B. anthracis diversity within the hyperendemic NCA region is limited to a single clade A sub-group
The NCA is a region where anthrax has likely been endemic for decades, if not centuries. Local community members claim that it has been an issue for their health and that of their livestock throughout living memory. Despite WGS providing sufficient discriminatory power to differentiate among individual B. anthracis isolates in this setting, fewer than 50 SNP differences were found among NCA isolates across the 5.2 Mb chromosomal genome, highlighting the degree to which this bacterial pathogen is monomorphic. All 73 sequenced isolates formed a monophyletic cluster within the ancient A lineage – also known as canSNP group A.Br.005/006 or A.Br.034 [6] – a subgroup of clade A comprised mostly of isolates from south-eastern Africa. This contrasts with some previous studies of anthrax diversity in chronically endemic areas reporting co-circulation of isolates from multiple lineages using lower-resolution markers [8, 9]. Our study, which only included isolates collected over 1 year, can be considered a snapshot of recent diversity only. That being said, Tanzanian isolates genotyped in a previous study (n=17) were also all found to belong to the ancient A clade [5], demonstrating that this particular lineage is dominant and well-established in this country. The most closely related publicly available isolate (A2075), sampled in central Tanzania approximately 300 km away from our study area, differed from some NCA isolates by only 13 SNPs, which illustrates that highly related B. anthracis isolates can be geographically widespread. Broader, longitudinal WGS studies of B. anthracis across different regions of Africa will be needed to assess the actual range of individual genotypes over space and time.
No phylogeographical signal observed despite considerable SNP diversity
Despite the considerable diversity observed (i.e. 22 unique genotypes within the sampled population), a phylogeographical signal was not detected at the scale of this study area. The finding of identical SNP profiles across distances of tens of kilometres likely reflects the ecology of anthrax in general and in our study system. First, there are few opportunities for genetic diversity to arise within the B. anthracis life cycle, which is characterized by long periods of environmental dormancy in spore form, punctuated by brief interludes of a few days where it develops into its vegetative state and replicates within an infected host. It is estimated that B. anthracis undergoes only 20–40 replications per infection [11]. Given its low mutation rate (5.2–8.3×10−10 mutations per site per generation) [5, 29, 50], novel mutations would likely arise in only a small proportion of B. anthracis infections, as also supported by our simulations. The short time of active replication within a host also means there is minimal opportunity for horizontal gene transfer with other bacteria [11], which further restricts the ability of B. anthracis to diversify. It is believed that B. anthracis rarely multiplies outside of a host, although there is some evidence for limited environmental replication by passing through the vegetative state [51, 52]. Viability of B. anthracis decays exponentially over time, although infectious spores remain detectable at carcass sites at least 4 years after the death of the animal [53], and under favourable conditions, spores can remain viable for up to several decades [20]. It is, therefore, reasonable to expect that spores from a single anthrax carcass with few or no novel mutations could be the source of subsequent infections over highly variable time periods, a phenomenon referred to by Sahl et al. [6] as a ‘time capsule’. This situation violates typical molecular clock assumptions [54], wherein SNPs would be expected to arise at a relatively steady rate within the pathogen population over time. The observation that contemporary isolates from the NCA are phylogenetically basal to isolate A2075, collected nearly two decades earlier, highlights this issue. In order for molecular clock models to be applied to environmentally persistent pathogens such as B. anthracis – for instance to estimate how long particular lineages have been in circulation – current analytical frameworks will need to be extended.
In addition to long and variable environmental persistence, animal movements and spatial admixture likely contribute to the lack of phylogeographical structure of B. anthracis within this study area. Potential sources of infection are quite spatially restricted, as the highest concentrations of viable spores are found within only a few metres of anthrax carcasses [55], in what have been termed ‘localized infectious zones’ [56]. However, given that the incubation period of anthrax in livestock typically ranges from 1 to 14 days and potentially longer [3], extensive movement can occur between the time of infection and the animal’s death, meaning the site of sampling likely does not reflect the site of infection. In parts of rural Africa where pastoralism is the main form of agriculture, livestock are moved for various reasons, including to access water, pastures and minerals. Such daily and/or seasonal livestock movements could contribute to the observation of identical SNP profiles over distances of tens of kilometres in our study area and the lack of relationship between genetic distance and geographical distance between sampling locations. While livestock appear to be the primary drivers of B. anthracis transmission in the NCA, movement of infected wildlife and scavengers acting as carriers of B. anthracis spores could represent an additional mechanism contributing to our observations [57, 58]. Further comparative genomic studies across wider areas will be essential for elucidating the geographical scales at which transmission occurs. This would help to delineate areas across which coordinated livestock vaccination campaigns should occur to avoid regular re-incursions through animal movement and trade.
High within-host diversity is the result of simultaneous infection with multiple variants, not within-host evolution
We observed high B. anthracis diversity within individual hosts, essentially indistinguishable from levels of diversity found throughout the study area. Smaller numbers of SNPs (1–2) could potentially have arisen during culture; however, based on previous passaging experiments with B. anthracis [29], we do not expect this to have contributed in our case due to the limited number of passages performed. Alternatively, small numbers of SNPs could have arisen during the course of infection within the host. The simulations we performed suggest that isolates with >2 SNP differences between them are unlikely to be the product of within-host evolution. The regular occurrence of such differences or greater among isolates from the same carcass indicates that animals are commonly infected with a heterogenous infectious dose (i.e. ingestion of a mixture of B. anthracis genotypes from single or possibly multiple grazing or watering points; Fig. 7). Multiple SNP profiles were present among isolates from the same individual soil samples, supporting the occurrence of heterogeneity in a single infection source. In principle, post-mortem contamination from the environment prior to sampling could have contributed to the diversity of profiles detected. However, the probability of this occurring is extremely low, given that detectable levels of spores tend to be spatially restricted close to previous carcass site locations [55]. Therefore, we believe that our finding of high within-host diversity cannot be explained by environmental contamination. It is also noteworthy that all within-host SNP profiles observed in this study were shared with isolates from at least one other sampled carcass (i.e. none were unique); this strongly suggests that most of the observed within-host diversity is the result of multiple genotypes having been present in the inoculum rather than having been generated de novo. Taken together, this points to a wide transmission bottleneck, since a small inoculum comprising only a few spores would limit the possible diversity that could be transmitted.
Our findings make an important contribution to the ongoing debate about the size of the transmission bottleneck for naturally occurring anthrax in animals (i.e. the number of spores that give rise to a case). Recent work has proposed that founding populations may be as small as 1–3 individual spores [30]. However, our findings are clearly at odds with such narrow bottlenecks, since the multiple genotypes we regularly observed within individual hosts could not have been transmitted within such a small inoculum. While our results provide no information about the exact size of the infectious dose, they align more closely with earlier suggestions of higher infectious doses, which are biologically plausible given that animals grazing at carcass sites might ingest hundreds of thousands of spores with each bite [53].
Spatio-temporally linked anthrax cases are rarely genetically linked
Limiting phylogenetic analyses to a single isolate per host often leads to incorrect inference of transmission events [22], including the potential to overlook important epidemiological connections [59]. This issue is exemplified in the current study by the spatio-temporally linked pair of cases in cluster 2: whereas several of the isolates from both carcasses (n=3 each) had identical SNP profiles, supporting a transmission link, both carcasses also harboured non-identical genotypes (Fig. 3). Under these circumstances, transmission links may be missed, even with complete sampling. As noted by Ågren et al. [29], in the case of multi-clonal infections, subsequent cases may stem from different genotypes from within the founding population, masking the fact that these cases stemmed from a common source, regardless of the number of isolates sequenced among the subsequent cases. This could be the case for the other three spatio-temporally linked pairs of anthrax carcasses investigated in this study (Fig. 3c), in which we only detected isolates with distinct SNP profiles. Alternatively, cases without a genetic link could be temporally linked for reasons other than exposure to a common source. For instance, animals might be more susceptible or at greater risk of exposure to infection at particular times of year, e.g. due to lower immune function related to their nutritional status and/or associated with weather extremes including prolonged rains or droughts [8]. More extensive sampling of within-carcass diversity would be necessary to investigate these hypotheses and to determine whether co-occurrence of genotypes could be used to track transmission patterns.
Conclusions
In this study, the genomic diversity of B. anthracis was quantified at various spatial scales within a hyperendemic setting. While WGS could discriminate among isolates within a relatively small geographical area, there was a lack of phylogeographical signal and limited genetic relatedness was observed among isolates from spatio-temporally linked cases. We hypothesize that this lack of spatial structure reflects the long-term persistence of B. anthracis spores in the environment, combined with extensive livestock movements related to local pastoralist practices. Based on simulations, the high within-host heterogeneity we observed points to an inoculum comprised of diverse genotypes, suggestive of a wide transmission bottleneck. Our work paves the way for studying B. anthracis genomic diversity and evolution within anthrax-endemic areas more broadly and to confirm the temporal and spatial scales over which genomic data are most informative for inferring transmission dynamics.
Supplementary Data
Funding information
T.L.F. was supported by a Marie Skłodowska-Curie Individual Fellowship (659223), a fellowship from the Natural Sciences and Engineering Research Council of Canada (PDF-471750-2015), the Wellcome Trust through a Springboard award to T.L. by the Academy of Medical Sciences (SBF002\1168), a Biotechnology and Biological Sciences Research Council (BBSRC) Discovery Fellowship (FORDE/BB/R012075/1), and a Lord Kelvin Adam Smith Leadership Fellow Fund from the University of Glasgow. O.R.A. was supported by a studentship from the Bill and Melinda Gates Foundation through the Program for Enhancing the Health and Productivity of Livestock (project reference ID 1083453). Establishment and maintenance of the Zoonoses Laboratory at the Kilimanjaro Clinical Research Institute (KCRI) (B.T.M.) was supported by the BBSRC (BB/J010367/1), BBSRC Zoonoses and Emerging Livestock Systems (BB/L017679, BB/L018926, BB/L018845), and the Wellcome Trust (096400/Z/11/Z). The funders had no role in study design, data collection and analysis, nor preparation of the manuscript.
Acknowledgements
We are grateful for all the support received for this research, particularly from the NCA community and authorities. We thank the Ngorongoro District Council, Ngorongoro Conservation Area Authority, District Veterinary Office, Tanzania Wildlife Research Institute and members of our field team – Sabore Ole Moko, Sironga Nanjicho, Kadogo Lerimba and Godwin Mshumba – for assistance with this study. We also thank the Directorate of Veterinary Services, Ministry of Agriculture, Livestock and Fisheries, and Ministry of Health, Community Development, Gender, Elderly and Children for their support. Finally, we thank Yi Xuan Chew and Nichith Kollanandi Ratheesh for initial assessments of the publicly available genomic data.
Author contributions
Conceptualization: T.L.F., T.L., R.B. Data curation: T.L.F., T.P.W.D. Formal analysis: T.L.F., T.P.W.D., W.T.H., M.M., A.V., R.B. Funding acquisition: T.L.F., B.T.M., T.L., R.B. Investigation: T.L.F., O.R.A., A.H., D.M., T.L. Methodology: T.L.F., T.P.W.D., W.T.H., M.M., R.B. Project administration: T.L.F., B.T.M., T.L. Resources: I.K., H.V.H. Software: T.P.W.D., W.T.H., M.M. Supervision: T.L., R.B. Visualization: T.L.F., T.P.W.D., W.T.H., M.M., A.V., R.B. Writing – original draft: T.L.F. Writing – review and editing: T.P.W.D., O.R.A., W.T.H., A.H., R.N.Z., T.L., R.B.
Conflicts of interest
The authors declare that there are no conflicts of interest.
Ethical statement
This study received ethical approval from the Kilimanjaro Christian Medical University College ethics review committee (certificate no. 2050); the National Institute for Medical Research, Tanzania (NIMR/HQ/R.8a/vol. IX/2660); the Tanzanian Commission for Science and Technology (2016-95-NA-2016-45); and the College of Medical Veterinary and Life Sciences ethics committee at the University of Glasgow (200150152). It also received permission under section 20 of the Animal Diseases Act 35 (1984) at the University of Pretoria, South Africa (reference 12/11/1/1/6). The Bill and Melinda Gates Foundation and the National Institute for Medical Research, Tanzania, approved submission of this manuscript.
Footnotes
Abbreviations: IQR, interquartile range; LMIC, low- and middle-income country; NCA, Ngorongoro Conservation Area; SRA, Sequence Read Archive; WGS, whole-genome sequencing.
All supporting data, code and protocols have been provided within the article or through supplementary data files. Five supplementary tables, nine supplementary figures and one supplementary file are available with the online version of this article.
References
- 1.Kao RR, Haydon DT, Lycett SJ, Murcia PR. Supersize me: how whole-genome sequencing and big data are transforming epidemiology. Trends Microbiol. 2014;22:282–291. doi: 10.1016/j.tim.2014.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Carlson CJ, Kracalik IT, Ross N, Alexander KA, Hugh-Jones ME, et al. The global distribution of Bacillus anthracis and associated anthrax risk to humans, livestock and wildlife. Nat Microbiol. 2019;4:1337–1343. doi: 10.1038/s41564-019-0435-4. [DOI] [PubMed] [Google Scholar]
- 3.WHO Anthrax in Humans and Animals. Geneva: World Health Organization; 2008. [PubMed] [Google Scholar]
- 4.WHO The Control of Neglected Zoonotic Diseases. Community-Based Interventions for Prevention and Control. Report of the Third Conference Organized with ICONZ, DFID-RIU, Gates Foundation, SOS, EU, TDR and FAO with the Participation of ILRI and OIE. Geneva: World Health Organization; 2011. [Google Scholar]
- 5.Van Ert MN, Easterday WR, Huynh LY, Okinaka RT, Hugh-Jones ME, et al. Global genetic population structure of Bacillus anthracis . PLoS One. 2007;2:e461. doi: 10.1371/journal.pone.0000461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Sahl JW, Pearson T, Okinaka R, Schupp JM, Gillece JD, et al. A Bacillus anthracis genome sequence from the Sverdlovsk 1979 autopsy specimens. mBio. 2016;7:e01501-16. doi: 10.1128/mBio.01501-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bruce SA, Schiraldi NJ, Kamath PL, Easterday WR, Turner WC. A classification framework for Bacillus anthracis defined by global genomic structure. Evol Appl. 2020;13:935–944. doi: 10.1111/eva.12911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Smith KL, DeVos V, Bryden H, Price LB, Hugh-Jones ME, et al. Bacillus anthracis diversity in Kruger National Park. J Clin Microbiol. 2000;38:3780–3784. doi: 10.1128/JCM.38.10.3780-3784.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Simonson TS, Okinaka RT, Wang B, Easterday WR, Huynh L, et al. Bacillus anthracis in China and its relationship to worldwide lineages. BMC Microbiol. 2009;9:71. doi: 10.1186/1471-2180-9-71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Achtman M. Evolution, population structure, and phylogeography of genetically monomorphic bacterial pathogens. Annu Rev Microbiol. 2008;62:53–70. doi: 10.1146/annurev.micro.62.081307.162832. [DOI] [PubMed] [Google Scholar]
- 11.Keim P, Van Ert MN, Pearson T, Vogler AJ, Huynh LY, et al. Anthrax molecular epidemiology and forensics: using the appropriate marker for different evolutionary scales. Infect Genet Evol. 2004;4:205–213. doi: 10.1016/j.meegid.2004.02.005. [DOI] [PubMed] [Google Scholar]
- 12.Fasanella A, Serrecchia L, Chiaverini A, Garofolo G, Muuka GM, et al. Use of Canonical Single Nucleotide Polymorphism (CanSNPs) to characterize Bacillus anthracis outbreak strains in Zambia between 1990 and 2014. PeerJ. 2018;6:e5270. doi: 10.7717/peerj.5270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Beyer W, Bellan S, Eberle G, Ganz HH, Getz WM, et al. Distribution and molecular evolution of Bacillus anthracis genotypes in Namibia. PLoS Negl Trop Dis. 2012;6:e1534. doi: 10.1371/journal.pntd.0001534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Maho A, Rossano A, Hächler H, Holzer A, Schelling E, et al. Antibiotic susceptibility and molecular diversity of Bacillus anthracis strains in Chad: detection of a new phylogenetic subgroup. J Clin Microbiol. 2006;44:3422–3425. doi: 10.1128/JCM.01269-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Girault G, Blouin Y, Vergnaud G, Derzelle S. High-throughput sequencing of Bacillus anthracis in France: investigating genome diversity and population structure using whole-genome SNP discovery. BMC Genomics. 2014;15:288. doi: 10.1186/1471-2164-15-288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Derzelle S, Girault G, Roest HIJ, Koene M. Molecular diversity of Bacillus anthracis in the Netherlands: investigating the relationship to the worldwide population using whole-genome SNP discovery. Infect Genet Evol. 2015;32:370–376. doi: 10.1016/j.meegid.2015.03.030. [DOI] [PubMed] [Google Scholar]
- 17.Derzelle S, Girault G, Kokotovic B, Angen Ø. Whole genome-sequencing and phylogenetic analysis of a historical collection of Bacillus anthracis strains from Danish cattle. PLoS One. 2015;10:e0134699. doi: 10.1371/journal.pone.0134699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lienemann T, Beyer W, Pelkola K, Rossow H, Rehn A, et al. Genotyping and phylogenetic placement of Bacillus anthracis isolates from Finland, a country with rare anthrax cases. BMC Microbiol. 2018;18:102. doi: 10.1186/s12866-018-1250-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lekota KE, Hassim A, Madoroba E, Hefer CA, van Heerden H. Phylogenomic structure of Bacillus anthracis isolates in the Northern Cape Province, South Africa revealed novel single nucleotide polymorphisms. Infect Genet Evol. 2020;80:104146. doi: 10.1016/j.meegid.2019.104146. [DOI] [PubMed] [Google Scholar]
- 20.Carlson CJ, Getz WM, Kausrud KL, Cizauskas CA, Blackburn JK, et al. Spores and soil from six sides: interdisciplinarity and the environmental biology of anthrax (Bacillus anthracis) Biol Rev Camb Philos Soc. 2018;93:1813–1831. doi: 10.1111/brv.12420. [DOI] [PubMed] [Google Scholar]
- 21.Döpfer D, Buist W, Soyer Y, Munoz MA, Zadoks RN, et al. Assessing genetic heterogeneity within bacterial species isolated from gastrointestinal and environmental samples: how many isolates does it take? Appl Environ Microbiol. 2008;74:3490–3496. doi: 10.1128/AEM.02789-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Worby CJ, Lipsitch M, Hanage WP. Within-host bacterial diversity hinders accurate reconstruction of transmission networks from genomic distance data. PLoS Comput Biol. 2014;10:e1003549. doi: 10.1371/journal.pcbi.1003549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Futse JE, Brayton KA, Dark MJ, Knowles DP, Palmer GH. Superinfection as a driver of genomic diversification in antigenically variant pathogens. Proc Natl Acad Sci USA. 2008;105:2123–2127. doi: 10.1073/pnas.0710333105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sintchenko V, Holmes EC. The role of pathogen genomics in assessing disease transmission. BMJ. 2015;350:h1314. doi: 10.1136/bmj.h1314. [DOI] [PubMed] [Google Scholar]
- 25.Didelot X, Walker AS, Peto TE, Crook DW, Wilson DJ. Within-host evolution of bacterial pathogens. Nat Rev Microbiol. 2016;14:150–162. doi: 10.1038/nrmicro.2015.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Paterson GK, Harrison EM, Murray GGR, Welch JJ, Warland JH, et al. Capturing the cloud of diversity reveals complexity and heterogeneity of MRSA carriage, infection and transmission. Nat Commun. 2015;6:6560. doi: 10.1038/ncomms7560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Alamil M, Hughes J, Berthier K, Desbiez C, Thébaud G, et al. Inferring epidemiological links from deep sequencing data: a statistical learning approach for human, animal and plant diseases. Philos Trans R Soc Lond B Biol Sci. 2019;374:20180258. doi: 10.1098/rstb.2018.0258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Beyer W, Turnbull PCB. Co-infection of an animal with more than one genotype can occur in anthrax. Lett Appl Microbiol. 2013;57:380–384. doi: 10.1111/lam.12140. [DOI] [PubMed] [Google Scholar]
- 29.Ågren J, Finn M, Bengtsson B, Segerman B. Microevolution during an anthrax outbreak leading to clonal heterogeneity and penicillin resistance. PLoS One. 2014;9:e89112. doi: 10.1371/journal.pone.0089112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Easterday WR, Ponciano JM, Gomez JP, Van Ert MN, Hadfield T, et al. Coalescence modeling of intrainfection Bacillus anthracis populations allows estimation of infection parameters in wild populations. Proc Natl Acad Sci USA. 2020;117:4273–4280. doi: 10.1073/pnas.1920790117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Tanzania National Bureau of Statistics, Office of Chief Government Statistician Population and Housing Census. Dodoma: Tanzania National Bureau of Statistics; 2012. [Google Scholar]
- 32.Mwakapeje ER, Høgset S, Fyumagwa R, Nonga HE, Mdegela RH, et al. Anthrax outbreaks in the humans – livestock and wildlife interface areas of Northern Tanzania: a retrospective record review 2006–2016. BMC Public Health. 2018;18:106. doi: 10.1186/s12889-017-5007-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lembo T, Hampson K, Auty H, Beesley CA, Bessell P, et al. Serologic surveillance of anthrax in the Serengeti ecosystem, Tanzania, 1996–2009. Emerg Infect Dis. 2011;17:387–394. doi: 10.3201/eid1703.101290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Aminu OR, Lembo T, Zadoks RN, Biek R, Lewis S, et al. Practical and effective diagnosis of animal anthrax in endemic low-resource settings. PLoS Negl Trop Dis. 2020;14:e0008655. doi: 10.1371/journal.pntd.0008655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15:R46. doi: 10.1186/gb-2014-15-3-r46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Connor TR, Loman NJ, Thompson S, Smith A, Southgate J, et al. CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community. Microb Genom. 2016;2:e000086. doi: 10.1099/mgen.0.000086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Broad Institute Picard toolkit. 2019. http://broadinstitute.github.io/picard
- 40.García-Alcalde F, Okonechnikov K, Carbonell J, Cruz LM, Götz S, et al. Qualimap: evaluating next-generation sequencing alignment data. Bioinformatics. 2012;28:2678–2679. doi: 10.1093/bioinformatics/bts503. [DOI] [PubMed] [Google Scholar]
- 41.Koboldt DC, Zhang Q, Larson DE, Shen D, McLellan MD, et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res. 2012;22:568–576. doi: 10.1101/gr.129684.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Minh BQ, Nguyen MAT, von Haeseler A. Ultrafast approximation for phylogenetic bootstrap. Mol Biol Evol. 2013;30:1188–1195. doi: 10.1093/molbev/mst024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Hijmans RJ. Raster: Geographic Data Analysis and Modeling Version 3.0-12 from CRAN ( https://rdrr.io/cran/raster) 2020.
- 46.Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24:1403–1405. doi: 10.1093/bioinformatics/btn129. [DOI] [PubMed] [Google Scholar]
- 47.Treangen TJ, Ondov BD, Koren S, Phillippy AM. The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol. 2014;15:524. doi: 10.1186/s13059-014-0524-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.R Core Team Vienna: R Foundation for Statistical Computing; 2018. R: a Language and Environment for Statistical Computing.https://www.R-project.org/ [Google Scholar]
- 50.Vogler AJ, Busch JD, Percy-Fine S, Tipton-Hunton C, Smith KL, et al. Molecular analysis of rifampin resistance in Bacillus anthracis and Bacillus cereus . Antimicrob Agents Chemother. 2002;46:511–513. doi: 10.1128/AAC.46.2.511-513.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Saile E, Koehler TM. Bacillus anthracis multiplication, persistence, and genetic exchange in the rhizosphere of grass plants. Appl Environ Microbiol. 2006;72:3168–3174. doi: 10.1128/AEM.72.5.3168-3174.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Braun P, Grass G, Aceti A, Serrecchia L, Affuso A, et al. Microevolution of Anthrax from a Young Ancestor (M.A.Y.A.) suggests a soil-borne life cycle of Bacillus anthracis . PLoS One. 2015;10:e0135346. doi: 10.1371/journal.pone.0135346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Turner WC, Kausrud KL, Beyer W, Easterday WR, Barandongo ZR, et al. Lethal exposure: an integrated approach to pathogen transmission via environmental reservoirs. Sci Rep. 2016;6:27311. doi: 10.1038/srep27311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Biek R, Pybus OG, Lloyd-Smith JO, Didelot X. Measurably evolving pathogens in the genomic era. Trends Ecol Evol. 2015;30:306–313. doi: 10.1016/j.tree.2015.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Dragon DC, Bader DE, Mitchell J, Woollen N. Natural dissemination of Bacillus anthracis spores in northern Canada. Appl Environ Microbiol. 2005;71:1610–1615. doi: 10.1128/AEM.71.3.1610-1615.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Blackburn JK, Ganz HH, Ponciano JM, Turner WC, Ryan SJ, et al. Modeling R₀ for pathogens with environmental transmission: animal movements, pathogen populations, and local infectious zones. Int J Environ Res Public Health. 2019;16:954. doi: 10.3390/ijerph16060954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.De Vos V. Vultures as carriers of anthrax. J S Afr Vet Assoc. 1979;50:35. [PubMed] [Google Scholar]
- 58.Stears K, Schmitt MH, Turner WC, McCauley DJ, Muse EA, et al. Hippopotamus movements structure the spatiotemporal dynamics of an active anthrax outbreak. Ecosphere. 2021;12:e03540. doi: 10.1002/ecs2.3540. [DOI] [Google Scholar]
- 59.Forde TL, Orsel K, Zadoks RN, Biek R, Adams LG, et al. Bacterial genomics reveal the complex epidemiology of an emerging pathogen in arctic and boreal ungulates. Front Microbiol. 2016;7:1759. doi: 10.3389/fmicb.2016.01759. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.UNEP-WCMC Protected Area Profile for Ngorongoro Conservation Area. 2020. www.protectedplanet.net
- 61.Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag; 2009. [Google Scholar]
- 62.Pebesma E. Simple features for R: standardized support for spatial vector data. R Journal. 2018;10:439–446. doi: 10.32614/RJ-2018-009. [DOI] [Google Scholar]
- 63.Yu G, Smith DK, Zhu H, Guan Y, Lam T-Y. Ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8:28–36. [Google Scholar]
- 64.Ciccarelli FD, Doerks T, von Mering C, Creevey CJ, Snel B, et al. Toward automatic reconstruction of a highly resolved tree of life. Science. 2006;311:1283–1287. doi: 10.1126/science.1123061. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.