Abstract
Emerging infections caused by fungi have become a widely recognized global phenomenon and are causing an increasing burden of disease. Genomic techniques are providing new insights into the structure of fungal populations, revealing hitherto undescribed fine‐scale adaptations to environments and hosts that govern their emergence as infections. Cryptococcal meningitis is a neglected tropical disease that is responsible for a large proportion of AIDS‐related deaths across Africa; however, the ecological determinants that underlie a patient's risk of infection remain largely unexplored. Here, we use genome sequencing and ecological genomics to decipher the evolutionary ecology of the aetiological agents of cryptococcal meningitis, Cryptococcus neoformans and Cryptococcus gattii, across the central African country of Zambia. We show that the occurrence of these two pathogens is differentially associated with biotic (macroecological) and abiotic (physical) factors across two key African ecoregions, Central Miombo woodlands and Zambezi Mopane woodlands. We show that speciation of Cryptococcus has resulted in adaptation to occupy different ecological niches, with C. neoformans found to occupy Zambezi Mopane woodlands and C. gattii primarily recovered from Central Miombo woodlands. Genome sequencing shows that C. neoformans causes 95% of human infections in this region, of which over three‐quarters belonged to the globalized lineage VNI. We show that VNI infections are largely associated with urbanized populations in Zambia. Conversely, the majority of C. neoformans isolates recovered in the environment belong to the genetically diverse African‐endemic lineage VNB, and we show hitherto unmapped levels of genomic diversity within this lineage. Our results reveal the complex evolutionary ecology that underpins the reservoirs of infection for this, and likely other, deadly pathogenic fungi.
Keywords: Cryptococcus neoformans, ecological genetics, fungi, microbial ecology, niche modelling
Introduction
Pathogenic fungi are widely responsible for emerging infections in humans as well as plants and animals by expanding their host range, increasing their virulence or through invading novel environments (Giraud et al. 2010; Fisher et al. 2012). Understanding the adaptation of fungal pathogens to ecological niches and the hosts therein is central to understanding how fungi emerge as pathogens, thereby explaining destructive processes such as the global panzootics and pandemics that they cause (Fisher et al. 2012).
Two closely related species of pathogenic basidiomycete yeasts, Cryptococcus neoformans and Cryptococcus gattii, are the aetiologic agent of the cryptococcal disease in humans, and have emerged as significant pathogens both in time and space (Casadevall et al. 2003; Nielsen et al. 2007). Despite the increased availability of antiretroviral therapy, cryptococcal meningitis remains a neglected disease that is responsible for a large proportion of AIDS‐related deaths (Van Wyk et al. 2014), reaching 70% in healthcare‐deprived areas of sub‐Saharan Africa (Desalermos et al. 2012). The majority of clinical infections are caused by C. neoformans which accounts for 99% of the cases found worldwide (Chayakulkeeree & Perfect 2008; Hagen et al. 2015). Although this is largely related to these fungi infecting an increasing pool of highly susceptible HIV/AIDS patients, spatial emergence is ongoing and C. gattii has been the focus of interest since the occurrence of the well‐documented outbreak and spread across the Pacific Northwest of Canada and the USA (Hoang et al. 2004; Kidd et al. 2004). In common with many other opportunistic pathogenic fungi, Cryptococcus is a saprophyte and is acquired from the environment; infected patients are not known to onwardly transmit the organism. For this reason, in order to understand the epidemiology of human cryptococcosis it is necessary to describe the ecology that supports the environmental reservoirs of this infection (Hiremath et al. 2008). Desnos‐Ollivier et al. (2010) described the ecological niches of C. neoformans as ‘extraordinarily complex’, due to their probable inclusion of other fungi, bacteria, protists or viruses. Increasingly, the role of ecological interactions within microbial communities and how these relationships can alter species distribution on large geographical scales are being addressed (Johnson & Stinchcombe 2007), and it is time that integrative approaches such as these are applied to the study of pathogenic fungi as well as the diseases that they cause.
Hardin (1960) formalized the principle of competitive exclusion; this states that two species which are competing for the same resources, if ecological factors remain constant, cannot coexist and maintain populations of similar size. The realized niche represents the actual niche where a given species is present, constrained by biotic and abiotic pressures. The fundamental niche is the complete range that a species can potentially occupy (Hutchinson 1959; Connell 1961). Current hypotheses accounting for the worldwide dissemination of C. neoformans state that the species gained the ability to metabolize pigeon guano, which became its realized niche. These adaptations may then have led to this pathogens worldwide spread via the use of birds and their migratory routes as vectors; in this way, the pathogen thus ‘explored’ and exploited its fundamental niche (Casadevall & Perfect 1998; Granados & Castañeda 2005). In contrast, C. gattii is thought to be more restricted to its realized niche, which is believed to be sedentary trees (Nielsen et al. 2007).
Genetic analysis to date has shown that infection by Cryptococcus in Africa is largely caused by C. neoformans and infections by C. gattii are rarely recovered (Wiesner et al. 2012; Chen et al. 2015) C. neoformans can be broadly divided into three major lineages with the first two, VNI and VNII, being globally distributed whilst a third, VNB, appears to be restricted to southern Africa. Across this region, studies have shown that VNB is largely associated with the Mopane woodland ecoregion while VNI and VNII have a tendency to be associated with urban areas, bird guano and non‐Mopane tree‐types (Chen et al. 2015). The genetic lineage of C. neoformans appears to play an important role in infection, with infections by VNB appearing more virulent (Beale et al. 2015). Together, these studies suggest that the different species and lineages of Cryptococcus in Africa inhabit markedly different ecological niches and that this heterogeneity leads to variation in not only risk of infection for HIV/AIDS patients, but also their risk of developing severe disease (Beale et al. 2015).
In Zambia, cryptococcal meningitis remains a serious issue as the country has one of the highest burdens of HIV in the world, with a prevalence estimated at over 13% (UNAIDS 2015). The landscape of the country has changed rapidly over the last decades owing to intense deforestation, changes in human demography and the development of industrial agriculture (Chidumayo 2013). Thus, Zambia provides a broad set of spatial scales and dynamic ecosystems within which to study the physical (abiotic) and macroecological (biotic) factors affecting the Cryptococcus species complex in its natural environment. Being an opportunistic environmental pathogen, Cryptococcus is not only under spatial and climatic pressures but also under the influence other fungal species or predators. Our study aims to recover and map the aetiological agent of human cryptococcosis from the environment, in a setting where the burden of cryptococcal meningitis is the highest worldwide (UNAIDS 2015). In this study, we isolated Cryptococcus from across the two main ecoregions that dominate Zambia, the soils and trees of the dry tropical Miombo forests and their counterparts across the Zambezi Mopane Woodlands. We model the ecological niches within which the two species occurred and, in concert, investigate the soil fungal community associated with either C. gattii or C. neoformans using an ITS2 metabarcoding approach to understand whether a particular microbial composition is associated with either pathogen. Finally, whole‐genome sequencing (WGS) was used to subsequently explore associations between environmental Cryptococcus neoformans genomes with those recovered from patients participating in an ongoing clinical trial in the capital city Lusaka in order to understand whether clinical isolates are associated with a particular genotype. Together, these data form a mosaic that reveals how physical and ecological factors structure the environmental diversity of Cryptococcus across the Zambian landscape, leading to infections in the people that find their home there.
Materials and methods
Environmental sampling
To determine the prevalence of Cryptococcus neoformans in Zambia, samples were collected during both the wet and dry seasons (n = 1356). Environmental sampling took place in January 2013 at the beginning of the rainy season, and a total of 583 samples were collected from soils, tree bark, pigeon guano and insects, substrates from which Cryptococcus species have been commonly isolated (Granados & Castañeda 2005; Litvintseva et al. 2005; Randhawa et al. 2005). The dry season was sampled in September 2013, and 773 samples were collected. Tree species were recorded, and tree tags were used with GPS coordinates in order to be able to return to each site. Two ecoregions were investigated throughout Zambia namely Central Miombo woodlands and the Zambezi and Mopane woodlands (White 1983). The Zambezi Mopane region is characterized by the tree flora being dominated by the Mopane tree (Colophospermum mopane). This species of tree was originally identified as potential reservoir of C. neoformans by Litvintseva et al. (2006) and covers 15% of Zambia, entering the country from the Botswana border. The Zambezian Mopane woodlands follow a low‐elevation area ranging from 200 to 600 m as opposed to the Central Miombo Woodlands which are contained between 800 and 1200 m. The Zambezi Mopane Woodlands region is thought to have significant evolutionary implications for biodiversity as it represents a changeover zone between tropical and subtropical biomes (World Wildlife Fund 2011), and remains a centre for Zambia's National Game Parks, chiefly the Luangwa valley. Covering more than 50% of the country, the Central Miombo woodlands represent the most extensive ecoregion in Zambia. The dominant tree type is Brachystegia sp., but sporadic grassy wetlands (‘dambos’) can make up to 30% of this ecoregion. The population density in the region is relatively low principally owing to the low‐productivity agriculture caused by nutriment‐poor soils that are dominated by Kalahari sands (World Wildlife Fund 2011). Subsistence slash‐and‐burn (chitemene) agriculture is practiced by a large proportion of the population, and as a consequence, the Miombo woodlands have been extensively deforested in the recent years.
Samples were collected in using Transwab® Amies swabs (MWETM – MW170) and sterilized 30‐mL screw‐capped glass bottle. Amies liquid transport swabs were taken from tree bark. Pairs of samples from bark and soil associated with each tree were also collected. Samples were collected and processed according to previously established protocols (Randhawa et al. 2005; Litvintseva et al. 2011), and the samples were kept at 4 °C until been processed on niger seed agar. All samples were collected under licence from the Zambian Wildlife Authority (ZAWA).
Isolation and identification of C. neoformans and Cryptococcus gattii
Niger seed medium is broadly used to identify both C. gattii and C. neoformans isolates as they produce melanin on the medium and can be distinguished from other brown yeast colonies from their characteristic dark brown colour (Staib 1987). Niger seed plates were incubated at 30 °C for 48 h. To obtain the niger seed medium, 70 g of niger seeds (Guizotia abyssinica) was pulverized and added to 1 L of distilled water. The mixture was autoclaved for 15 min. After allowing the solution to cool off, the solution was then filtered through using a triple layer of cheesecloth. One gram of glucose (Sigma‐Aldrich), 1 g of KH2PO4 (Sigma‐Aldrich), 0.78 g of creatinine (Sigma‐Aldrich) and 15 g of agar (Sigma‐Aldrich) were added to the niger seed extract. Distilled water was added accordingly to obtain 1 L of niger seed extract solution, and the medium was autoclaved a second time for 30 min. Before dispensing the mixture onto culture plates, 0.05 g of chloramphenicol (Sigma‐Aldrich) was resuspended in 1 mL of 95% ethanol and added to the niger seed medium to inhibit bacterial growth. The tubes containing soil or animal droppings were weighted to 0.5 g and distilled in 10 mL of distilled water. The samples were then further diluted to 1/10 and 1/100. Swabs were suspended in 1 mL of distilled water and diluted to 1/10. Then, 80 μL of both the undiluted and diluted samples was spread onto niger seed plates and each dilution was duplicated. Single colonies were isolated from plates and subcultured. During genomic DNA extraction, all purified single colonies were cultured for 60 h at 37 °C in a 50‐mL falcon tube containing in 5 mL of yeast protein digest (YPD) media supplemented with 0.5 m of NaCl at 250 rpm in order to reduce capsule size. The extraction process was performed using MasterPure Yeast DNA purification kit (Epicentre, UK) using a bead‐beating step with Mini‐Beadbeater‐16 (Cat. No. 607; Biospec). The genomic DNA was amplified by polymerase chain reaction (PCR) to distinguish between C. gattii and C. neoformans (Cn) using two sets of primers: CAP59, a capsular gene present in both Cn and C. gattii species, and SOD1, which is specific for Cn (Meyer et al. 2009).
To avoid contamination with filamentous fungi, individual yeast colony resembling C. neoformans (dark yellow/brown colonies) was isolated further onto another niger seed agar plate and on SDS agar. For each sample, a total of five individual dark brown colonies were isolated and their DNA was extracted. The extraction process was performed using MasterPure Yeast DNA purification kit (Epicentre) using an additional bead‐beating step. The genomic DNA was amplified by PCR to distinguish between C. gattii and C. neoformans using two sets of primers: CAP59, a capsular gene present in both Cryptococcus neoformans and C. gattii species, and SOD1, which is specific for C. neoformans (Meyer et al. 2009). A Pearson's chi‐squared test was used to assess potential differences in the proportions of C. gattii and Cn in the different ecoregions investigated. The test statistic was implemented in r software (v 3.1.1).
Mating types
The mating type for each environmental Cn isolates was determined using PCR analysis. Reactions were performed in a total volume of 25 μL and comprised of 0.8 μL of each of the following primers (10 μm), JOHE7264, JOHE7265, JOHE7270, JOHE7272 (de Oliveira et al. 2004). 0.5 μL of MgCl2 was added to the reaction mix with 0.5 of dNTPs (10 μm) (R0191; ThermoFisher), 2.5 μL of Rxn Buffer, 17.4 μL of H2O and 0.1 of Taq polymerase (cat: 18038018; ThermoFisher). Reaction conditions were as follows: JOHE7264/JOHE7265; JOHE7072/JOHE7272‐: Control method: calculated: (i) initial denaturation (94 °C for 3:00 min), (ii) denaturation (94 °C for 30 s), (iii) annealing (55 °C for 30 s), extension (72 °C for 1:00 min), (iv) go to (ii), (× 35), (v) final elongation (72 °C for 10:00 min) and 4 °C for infinity.
Fungal community structure using an ITS2 metabarcoding
The ITS region provides an ideal target to detect and delineate fungal species and has been extensively used for the taxonomic profiling of fungi (Xu 2006; Lindahl et al. 2013). In 2012, this genetic marker was proposed as the universal genetic barcode for fungi (Schoch et al. 2012). A comparison between ITS1 and ITS2 revealed that ITS2 was more variable and allowed a better assessment of the fungal diversity (Bazzicalupo et al. 2013). Therefore, the current study used ITS2 for metabarcoding sequencing. The ITS2 rDNA region was amplified using ITS3 KYO2 and ITS4 KYO3 (Toju et al. 2012). The two primers were paired with appropriate Illumina adapter overhang nucleotide sequences and a 2‐bp linker sequence. The ITS2 region of 61 samples taken from soil adjacent to sampled trees was amplified using an Illumina MiSeq instrument using a 300‐bp paired‐end sequencing run. Eleven samples which were found positive for the Cryptococcus species complex were included in the sample panel to assess whether the particular mycobiome was associated with the presence of Cryptococcus. Briefly, 0.25 g of soil was weighted and transferred to PowerBead Tubes (MO BIO PowerSoil DNA Isolation Kit). Amplifications were carried out in at total volume of 25 μL using 2.5 μL (5 ng/μL) of DNA, 12.5 μl 2× KAPA HiFi HotStart Ready Mix (ANACHEM, catalogue: KK2602) and 5 μL (1 μm) of each primer. PCR conditions for ITS3_KYO2/ITS4_KOY3: Control method – calculated: (i) initial denaturation (94 °C for 3:00 min), (ii) denaturation (94 °C for 30 s), (iii) annealing (55 °C for 30 s), extension (72 °C for 1:00 min), (iv) go to (ii), (× 25), (v) final elongation (72 °C for 5:00 min). Products were then kept at −20 °C until DNA precipitation step. Amplicons were visualized using Agilent 2200 TapeStation (Agilent Technologies, Inc). The libraries were then prepared according to a standard Illumina Protocol. Soil constitutes a highly complex environment that is mostly unexplored in sub‐Saharan Africa, and a better understanding of its structure and diversity is required to understand the fungal interaction with other organisms (Lim et al. 2010). For each sample, between 5 and 10 g of soil were collected in various ecoregions in Zambia during both the dry and rainy seasons. Soil was kept at 4 °C during field collection and frozen at −20 °C in the laboratory until being processed. DNA extraction and PCR amplification of the genomic DNA were extracted from 74 soil samples collected in various part of Zambia using the PowerSoil DNA Isolation Kit (cat. 12888‐100; MOBIO Laboratories, Inc.). A negative control was used to account for eventual taxa resulting from laboratory contamination. The genomic DNA was then purified and Illumina paired‐end libraries were prepared according to the Illumina protocol. The ITS2 region was amplified using an Illumina MiSeq instrument on a 300‐bp paired‐end sequencing run. Paired‐end libraries were prepared for sequencing in a two‐step PCR approach using the Illumina 16S Metagenomic Sequencing Library Preparation Protocol. The ITS2 rDNA region was amplified using ITS3_KYO2 and ITS4_KYO3 (Toju et al. 2012). The two primers were paired with appropriate Illumina adapter overhang nucleotide sequences and a 2‐bp linker sequence. The sequence of the two primers with linker and overhang sequences are detailed on Fig. S1.
Environmental variables
A number of environmental data were incorporated into the analysis to explore the relationship between environmental factors and the Cryptococcus species complex. Data were selected aiming to measure the operational environment of the saprophyte. Other studies which aimed to model the fundamental niche of the pathogen identified relevant climatic and spatial variables to study the distribution of Cryptococcus using a jackknifing procedure (Mak 2007; Mak et al. 2015). In the present study, the same environmental variables were explored. Cryptococcus is known to be sensitive to temperature and atmospheric conditions, and these variables are investigated using the WorldClim Global Climate Data (Busby 1991). Additionally, altitudinal influence, Normalized Difference Vegetation Index (NDVI) and Enhanced Vegetation Index (EVI) were also considered as they are commonly used as predicators in disease mapping (O'Hanlon et al. 2016). Environmental variables were extracted from WorldClim Global Climate Data (Busby 1991), and the soil information was found using the Africa Soil Information Service (Hengl et al. 2015). The information for each sample was extracted in r software (v 3.1.1) using the raster (Hijmans et al. 2012) and dismo (Hijmans et al. 2012) packages. The EVI and the NDVI from 2012 and 2014 were extracted and averaged over this period using MODISTools (Tuck et al. 2014).
Ecological niche modelling
Zambia provides an appropriate setting to model the ecological niches of C. neoformans and C. gattii. maxent is an species habitat modelling software, and it uses maximum entropy to model species’ geographic distribution (Phillips & Dudík 2008) based on presence‐only data. The software was used to generate the predicted distribution of the two sister species in Zambia. The distribution was modelled using Bioclim layers (Busby 1991) and altitude. The niche model produced by maxent is an approximation of the species’ ecological niche within the context of the environmental dimensions that we investigated. The purpose of the software advanced by Phillips et al. (2006) is to make predictions from incomplete information. maxent calculates the probability distribution of the maximum entropy. The latitude and longitude of the isolates constitute the sample points and the Bioclim layers the environmental features. To test model prediction, 25% of the samples were randomly set aside (Fig. S9, Supporting information).
Whole‐genome sequencing and SNP calling
High molecular weight DNA was extracted, and DNA libraries were prepared for Illumina HiSeq paired‐end sequencing. Our studies focused on C. neoformans as this pathogen represents the majority of cryptococcal meningitis cases in Zambia and, more widely, southern Africa. Environmental isolates collected from the environment were sequenced along with 23 clinical C. neoformans genomics acquired from an ACTA Lusaka Trial in Lusaka, Zambia. C. neoformans colonies were cultured in 5 mL of YPD media (Sigma‐Aldrich) supplemented with 0.5 m of NaCl for 60 h at 37 °C, shaking at 165 rpm. Forty‐seven environmental and clinical genomes from single‐colony isolates of Cryptococcus collected across Zambia were sequenced with average 100× coverage. DNA concentrations were measured with a Qubit® Broad Range dsDNA Assay (Life Technologies™) on Qubit® 2.0 Fluorometer (Q32866; Invitrogen™) according to the manufacturer's instructions. The genomic DNA was then diluted to 2 ng/μL in nuclease‐free water. Libraries were prepared using the TruSeq® Nano DNA Sample Preparation Kit (Illumina; FC‐121‐4001). Each time, 24 libraries were prepared for paired‐end sequencing on two lanes of Illumina® HiSeq to sequence 175‐bp fragments. Libraries were quantified by qPCR on an Applied Biosystems 7300 instrument (Life Technologies) using the Kapa library quantification kit (Kapa Biosciences, Boston, MA, USA) and the 2200 TapeStation (Agilent) with D100K ScreenTape assays (Agilent) as described in Rhodes et al. (2014). The information of these two assays was subsequently used to determine the dilution needed for normalizing each library to the same concentration and the pooling of each library. The libraries were then normalized to 10 nm and pooled together (12 samples/pool). The pooled libraries were sequenced by the Medical Research Council Clinical Genomics Centre (Hammersmith, London, UK) to a read length of 100 bp on an Illumina® HiSeq 2000 or 2500 sequencer. A total of 12 isolates were sequenced per lane of each flow cell, loading 16 pm of the pooled libraries. All raw reads and lineages information regarding each isolates have been submitted to the European Nucleotide Archive (MiSeq ITS project: PRJEB13820; WGS C. neoformans Project: PRJEB13814).
Reads were aligned to the Cryptococcus neoformans H99 reference (Loftus et al. 2005) using the short‐read alignment component of Burrows‐Wheeler Aligner (BWA) 0.75a aln (Li & Durbin 2009) using a quality threshold of 15 as described by Rhodes et al. (2014). FastQs were converted to SAM format using BWA and converted to BAM files, and the BAM files were then sorted and indexed with samtools version 0.1.18 (Li et al. 2009). Duplicated reads were marked with picard tools (v. 1.72; github.com/broadinstitute/picard). The resulting BAM files were recalibrated around insertions or deletions (INDELs) using the GATK RealignerTargetCreator and IndelRealigner (McKenna et al. 2010). The detection of single nucleotide polymorphisms (SNPs) and INDELs was called using GATK unifiedgenotyper version 2.2‐2 in haploid mode (DePristo et al. 2011; Van der Auwera et al. 2013). SNPs and INDELs were filtered to call only high‐confidence variants, according to whether they were present in 80% of reads. Mapped reads for each isolates are given on Table S9 (Supporting information).
Phylogeny and population assignment
Whole‐genome SNPs files were converted to Nexus and Phylip format. raxml, executing 1000 repaid bootstrap inferences with a generalized time reversible substitution matrix (Stamatakis 2006) was used to generate bootstrapped maximum‐likelihood trees over 1000 replicates, which were visualized in figtree version 1.4.0 (http://tree.bio.ed.ac.uk/software/figtree/). tempest v1.5 was used to root the phylogenetic tree (Fig. 2) and to ensure that a strict molecular clock could be applied (Rambaut et al. 2016). To first analyse the population structure of environmental Cryptococcus populations, we used ChromoPainter and fineStructure (Lawson et al. 2012). ChromoPainter constructs a coancestry matrix based on individual SNPs. Each individual is considered in terms of either being a donor or a recipient of ‘chunks’ of DNA. The coancestry matrix then records the inferred recombination events between each donor and recipient prior to coalescing with another genome.
Statistical analysis
ITS2 metabarcoding
Raw Illumina fastq files were merged into paired‐end reads using PANDASeq (Masella et al. 2012). Then, the clustering of ITS2 sequences, their taxonomic assignments and the analyses were performed using the quantitative insights into microbial ecology (qiime) software v1.8.0 (Caporaso et al. 2010). A chimera filtering step was performed using usearch v7 (Edgar 2010). The clustering and assignment of reads into operational taxonomic units (OTUs) were achieved using the OTUs‐picking workflow in qiime, and reads were grouped using usearch and uclust. These algorithms divide sequences into clusters using a 97% threshold of pairwise identity and a maximum e‐value of 0.001 (Edgar 2010). The OTU table was rarefied at a sequencing depth of 300 to remove sample heterogeneity. The UNITE fungal its database (v.12.9) (Abarenkov et al. 2010) was used to map OTUs to a reference and the identification of extracted ITS2 OTUs was performed using blast. A negative control was included and reads from the negative control were removed; these were assigned at the genus level to Trichosporon and others to ‘unidentified fungi’. α− and β‐diversity were calculated using qiime; α‐diversity is known as the species richness of a particular community (Whittaker 1960) and β‐diversity is the extent of change of community composition across space and time. Rarefaction curves are obtained from α‐diversity measures (observed OTUs and chaos1 estimate) and allowed to assess taxonomic richness of the samples (Magurran 2013). β‐Diversity was estimated using the Bray–Curtis metric. This method allows to calculate pairwise distances between fungal communities. Principal coordinate analysis were computed using the core‐analysis framework within qiime. To investigate potential differences in microbial β‐diversity between ecoregions or seasons, the analysis of similarity test (ANOSIM) and ADONIS with 999 permutations were used between the different categories based on Bray–Curtis dissimilarity.
ANOSIM is based on a standardized rank correlation analysis between two matrices. The test is commonly used in community ecology. ANOSIM examined the variation in species composition among different grouping factors. Samples are assigned to groups and ANOSIM test whether there are significant differences between these groups (Clarke 1993). A P‐value of 0.001 indicates significant differences between the groups at α = 0.05. An R coefficient superior to 0.25 implies that groups are different with some overlap. If the R coefficient exceeds 0.5, the groups are considered different (Fierer et al. 2010). ADONIS is similar to ANOSIM and is widely used in analysis of ecological community data. It is common practice in community ecology to combine these two statistics. The ADONIS method assesses the significance between sample groups based on a distance matrix. The analysis is similar to anova. The statistical significance is achieved by partitioning the sum of squares of the data set based on permutations and using Bray–Curtis matrices. Then, the method computes a R 2 coefficient which transcribes the percentage of variation explained by the categories and gives the statistical significance (Anderson 2001).
One‐way analysis of variance (anova) was used to identify taxa which differed between sample groups. A best variables rank correlation test (BEST) was then performed to rank the relative importance of environmental conditions on β‐diversity. The BEST analysis identifies subsets of variables whose Euclidian distances are maximally correlated with the Bray–Curtis matrix. The correlation is computed using a Spearman's rank correlation coefficient (Spearman 1904). Multivariate analysis of variance (manova) also known as ‘permutation anova’ was performed within the two main ecoregions (Miombo woodland and Zambezi Mopane) with 999 permutations. This method is similar to ADONIS as it partitions sums of squares. The method was used to assess the influences of the climatic factors within each ecoregion by returning a pseudo‐F value and a P‐value. Only OTUs present in more than 0.001% of the total filtered sequences were considered when comparing ecoregions and seasons. To highlight the taxa overlap within the different ecoregions, Venn diagrams were generated based on OTUs presence and visualized using http://bioinfogp.cnb.csic.es/tools/venny/index.html Oliveros (2007–2015). Taxonomic differences between samples of each ecoregion were tested using linear discriminant analysis (LDA) effect size (Segata et al. 2011). We first employed the factorial Kruskal–Wallis sum‐rank test (α = 0.05) to identify taxa with significant differential abundances between categories (ecoregion and season). Then, LDA effect size was applied to estimate the effect size of each differentially abundant taxa.
Population genetics
General population statistics were generated for VNI and VNB lineages of C. neoformans; the number of segregating sites (S), total number of mutations, number of singletons, nucleotide diversity (π), Waterson's estimator (θ), and Tajima's D and LD′ were estimated using variscan v.2.0 (Hutter et al. 2006) (Table S13, Supporting information). The nucleotide diversity measures the degree of polymorphism in a population (Nei & Li 1979). The Waterson estimator estimates the population mutation rate and decreases when the sample size or the recombination rate increase (Watterson 1975).
Results
Recovery of Cryptococcus in Zambia
We sampled the two main ecoregions of Zambia: Central Miombo woodlands (MW; n = 314 sites) and Zambezi Mopane Woodlands (ZM; n = 304 sites) (Table 1). A total of 1391 samples were collected from soil and trees across these Zambian ecoregions during both the dry (n = 773) and rainy (n = 618) seasons (Fig. S2, Supporting information). A total of 24 isolates from geographically distinct samples for Cryptococcus neoformans and 38 Cryptococcus gattii were identified. The species recovery rate of the Cryptococcus species complex was 5.2% in the rainy season and 3.9% in the dry season. The Zambezi Mopane ecoregion was most strongly associated with C. neoformans (19/24 isolates), whereas C. gattii was more strongly associated with the Central Miombo ecoregion (32/38 isolates); each of these species was significantly associated with their respective ecoregion (P < 0.001, Pearson's chi‐squared test).
Table 1.
Abbreviation | Name |
---|---|
Cn | Cryptococcus neoformans |
Cg | Cryptococcus gattii |
ZM | Zambezi Mopane Woodlands |
MW | Central Miombo woodlands |
WGS | Whole‐genome sequencing |
MLST | Multilocus sequence typing |
YPD | Yeast protein digest |
QIIME | Quantitative insights into microbial ecology |
OTUs | Operational taxonomic units |
PCoA | Principal coordinate analysis |
ANOSIM | Analysis of similarity test |
anova | Analysis of variance |
manova | Multivariate analysis of variance |
BEST | Best variables rank correlation test |
LDA | Linear discriminant analysis |
Association between environmental factors and fungal community structure
Modelling the ecological niches of the two sister species across Zambia aimed to predict the realized niche of C. neoformans and C. gattii. The predictive environmental niche model built using spatial and climatic variables (Fig. 1) showed that the ecological niche of C. neoformans is largely associated with the C. mopane tree belt, whereas the realized niche of C. gattii is largely within the wetter, higher altitude, Central Miombo woodlands. The environmental variables which gave the highest relative contribution to the model were for C. neoformans: Isothermality (bio3) accounting for 71.2% of the model variation and Precipitation Seasonality (bio15) accounting for 22.0%. With respect to C. gattii, the relevant climatic variables were Isothermality (bio3) 26.5%, ‘Precipitation of Wettest Quarter’ (bio16) 18.4%, ‘Precipitation of Warmest Quarter’ (bio18) 16.8% and ‘Mean Temperature of Driest Quarter’ (bio9) 7.9% (Table S1, Supporting information).
Fungal community structure assessed through high‐throughput barcoding was shown to vary significantly across the two ecoregions investigated (R ANOSIM = 0.3704, P < 0.001) (Table 2). Elevation, latitude and longitude improved regression coefficients showing that these factors are playing a significant role in the distribution of fungal communities where the Cryptococcus genus is embedded (Table S3, Supporting information). These variables were also significant when investigated using ADONIS, excepting for longitude (R 2 ADONIS = 0.0183, P < 0.198). The microbial compositions of the 11 positive samples were compared to nonpositive sites (n = 50) in an attempt to identify potential taxa associated with either C. gattii or C. neoformans. However, no significant associations could be detected (R ANOSIM = 0.0633, P < 0.263). We found that the abundance of fungal phyla varied between each ecoregion. Generally, the number of sequences per sample was higher in Zambezi Mopane Woodlands samples (Zambezi Mopane = 207 OTUs per sample, Miombo woodland = 149 OTUs per sample, P < 0.001). Across the Zambezi Mopane woodlands, Ascomycota represented 35.61% (n OTUs = 3356) of all OTUs, followed by Basidiomycota (11.15%, n OTUs = 1754). Across the Central Miombo woodlands samples, Ascomycota represented 32.69% (n OTUs = 2392) and Basidiomycota 11.20% (n OTUs = 1740). Figure 1a portrays the fungal community structure associated with each ecoregion and the complete distribution of OTUs among samples is given on Figs S4–S6 (Supporting information). A total of 1866 genera were identified and were found to be unevenly distributed across each ecoregion (Miombo woodland = 754, Zambezi Mopane = 1112). The fungal diversity across each of the different ecoregions investigated was compared (Fig. S3a, Supporting information), and rarefaction curves based on Chaos1 α‐diversity metric showed a saturation within the two ecoregions (Figs S3b and S6, Supporting information). This indicates that the OTUs identified in the survey accounted for a large proportion of the expected fungal diversity present in Zambian soil. Our analysis investigated samples over 900 km apart within Zambezi Mopane Woodlands and 400 km within Central Miombo woodlands, and no significant variation could be observed (Zambezi Mopane: RANOSIM = −0.008, P = 0.499; Miombo woodland: RANOSIM = 0.106, P = 0.189), indicating that soil fungal diversity appeared relatively uniform across space within each ecoregion. On one hand, season played a significant role in shaping the microbial diversity in the Zambezi Mopane ecoregion (R ANOSIM = 0.3610, P = 0.019) while, on the other hand, the patterns of fungal diversity in the Central Miombo woodlands appear to remain stable across seasons (R ANOSIM = 0.1062, P = 0.189) (Table S4, Supporting information). Linear discriminant analysis effective size revealed broad taxonomic trends in the mycobiome associated with the two ecoregions with the Tremellales (the order of wood‐rotting jelly fungi of which Cryptococcus is a member) being more abundant in the Zambezi Mopane Woodlands (Figs 1d, S7 and S8, Supporting information).
Table 2.
Bray–Curtis dissimilarity | |||||
---|---|---|---|---|---|
ANOSIM | ADONIS | ||||
Group | Factor | R | P | R 2 | P |
Central Miombo woodlands | Ecoregions | 0.4184 | 0.001 | 0.0428 | 0.001 |
Zambezi Mopane Woodlands | Ecoregions | 0.2726 | 0.001 | 0.0416 | 0.001 |
Zambezi Mopane Woodlands | Central Miombo woodlands | 0.3704 | 0.001 | 0.0687 | 0.001 |
Whole‐genome sequencing
For our phylogenetic analyses, owing to the predominance of pathogenic isolates being C. neoformans, we focused our attention on this species. Mapping Illumina reads (average reads mapped per isolate = 88%) for the 24 environmental and 23 clinical Zambian C. neoformans against the 19 Mb VNI H99 reference genome (Loftus et al. 2005) identified a total of 822 772 SNPs (Table S9, Supporting information). On average, reads were mapped at 88% to the reference. Phylogenetic analysis determined the occurrence of 21 VNI, 23 VNB and 3 VNII C. neoformans in our panel (Fig. 2). These data can be visualized in a Microreact project at https://microreact.org/project/S1lkajtY. The majority of the clinical isolates were VNI isolates (76%) showing that this molecular type dominates in human infections. Strikingly however, the large majority of environmental isolates (83%) belonged to lineage VNB. As shown in Fig. 2, the VNI, VNII and VNB molecular types clearly constitute separated lineages, supporting earlier work from southern Africa using multilocus sequence typing (MLST; Litvintseva et al. 2003, 2006, 2011; Simwami et al. 2011; Beale et al. 2015; Chen et al. 2015). These results were corroborated with fineStructure analysis (Fig. 3), where very limited genetic exchange was observed between the different molecular types. Among VNI (n = 24) and VNII isolates (n = 3), 105 391 and 366 086 SNPs were found, respectively. The VNB clade harboured much higher genetic diversity with a total of 521 924 SNPs being mapped (Table S10, Supporting information). Within VNB, two clearly defined subpopulations, VNB‐A and VNB‐B, were observed to subdivide this highly genetically diverse lineage (Figs 2 and 3). VNB‐A and VNB‐B were found to have 228 688 SNPs in common; 170 258 SNPs were private to VNB‐A and 188 273 were private to VNB‐B. All 14 isolates of VNB‐B were environmental in their origin, whereas four of nine isolates of VNB‐A were recovered from HIV‐AIDS patients.
Population genomics of environmental VNB isolates showed that these environmental isolates group into two statistically supported clades with 35% of VNB isolates possessing the MATa mating‐type locus. VNI isolates displayed a more clonal profile with the MATa mating‐type being much rarer (4%). VNI infections likely represent urban acquired infections, and VNB C. neoformans which are mainly present across the Colophospermum mopane region, likely reflect infection acquired in rural settings. Genetic diversity is higher in the VNB clade with the number of segregating sides (S), number of mutations (η), number of external mutations (ηE), number of nucleotide differences per site (π) greatly exceeding those found in VNI (Table S13, Supporting information). Among the VNB isolates, nine of the 23 isolates possessed the MATa locus (39%). Within the fineStructure analysis (Fig. 3), two VNB subpopulations could be observed, VNB‐A and VNB‐B. In the VN VNB‐A clade, five were positive for MATa, and six were positive for MATα locus (45% MATa). In the VNB‐B lineage, three isolates were MATa positive and seven possessed MATα (30% MATa). In the VNI clade, only one of the 21 isolates possessed the MATa locus (4%). The mating type of each isolate is listed on Table S9 (Supporting information). The VNB molecular type displayed a slightly negative Tajima's D value (Tajima's D = −0.265) indicating a surplus of rare alleles, whereas the VNI population had a strong positive value (Tajima's D = 0.884), indicating a sudden population contraction or balancing selection (Tajima 1989) (Table S13, Supporting information).
Discussion
We report here the first population genomic analysis of Cryptococcus neoformans, a fungus that is responsible for hundreds of thousands of deaths each year in Africa and which accounts for around 17% of AIDS‐associated mortality. Our use of high‐resolution WGS to analyse environmental and clinical populations of this pathogen has uncovered hitherto unmapped levels of genomic diversity, and we show that this diversity is deeply structured across even the scale of the single African country, Zambia, within which our study took place. These data have allowed the identification of three lineages of C. neoformans that occur within Zambia, describing highly divergent epidemiological patterns that reflect the environmental reservoirs of infection that our study defines.
Our genomic data extend the findings of Chen et al. (2015) who showed that the VNB molecular type is widely present in the arboreal environment, specifically the Mopane belt, across the neighbouring country of Botswana. The high recovery of VNB from Mopane trees across Zambia confirms the Zambezi Mopane Woodland ecoregion as the main niche for C. neoformans VNB. Strikingly, our phylogenomic analysis showed that VNB is not only over fivefold more genetically diverse than the pan‐global lineage VNI, but also contained two strongly supported subclades, here named VNB‐A and VNB‐B. A previous report by our group in South Africa showed that patients infected with the VNB molecular type had significantly worse survival than those infected with VNI (Beale et al. 2015). To which subclade of VNB the South African virulent clinical isolates belong is not currently known, as is the relative virulence of Zambian clinical VNB‐A compared to their environmental VNB‐B or clinical VNI counterparts. These analyses await the cessation of the ACTA Lusaka Trial and subsequent analysis of the trial data in conjunction with a larger panel of sequenced isolates from across the full longitudinal spectrum of samples recovered from Zambian patients.
Over three‐quarters of clinical infections in our Zambian cohort were caused by the VNI molecular type. We found a similar pattern to that described by Chen et al. (2015) in Botswana, where VNI was associated with clinical infections in urban areas and VNB was found strongly associated with Mopane trees in rural settings. In both studies, the VNII molecular type was found to be rare. Although it is not possible to infer where patients in the trial acquired their infections due to the potential for reactivation of latent infection in the setting of profound immunosuppression, taken together our studies support the hypothesis that intensified urbanization across southern Africa has led to high densities of domestic pigeons, Columba livia, which have led to an amplification of VNI‐related infections in cities such as Lusaka and Gaborone (Litvintseva et al. 2011; Simwami et al. 2011). We find that 96% of VNI isolates are the MATα mating type, a finding that highlights the clonal profile of the VNI molecular type compared to VNB isolates which displayed a high proportion (36%) of the MATa mating type. The current understanding of the evolutionary history of VNI derived from MLST analyses is that the low‐diversity, pan‐global, nature of its genotype reflects a global dissemination alongside the domestication of pigeons within recent history (Litvintseva et al. 2011; Simwami et al. 2011). Our genomic data support this theory by confirming that VNI is genetically depauperate compared to both VNII and VNB. Further understanding of the evolutionary epidemiology of these lineages within their global context awaits the broad‐scale phylogenomic studies that will inevitably follow our studies lead.
Across the broader species complex, we show that C. neoformans and its sister species, C. gattii, were associated with different ecotypes in Zambia. This shows that the evolutionary separation of these two species, which speciated around 37 Ma (Xu et al. 2000), has resulted in their adaptation to occupy markedly different ecological niches. C. neoformans was predominantly recovered from trees, mainly Colophospermum mopane (49% of sampled trees) that dominated the Zambezi Mopane ecoregion. In comparison, C. gattii was found predominately in the Central Miombo Woodlands, where 32 of the 38 isolates (84%) were isolated. The dominant tree type in the Miombo woodlands is Brachystegia sp., and six C. gattii isolates (16%) were recovered from these trees (Table S12, Supporting information). Two isolates of C. gattii were also found on Eucalyptus, a tree species which has been associated with C. gattii for decades (Ellis & Pfeiffer 1990; Pfeiffer & Ellis 1992). Interestingly, we also found that C. gattii was strongly associated with hyrax faeces (15 isolates in 52 samples). This finding identifies a new potential animal reservoir for C. gattii and emphasizes how ubiquitous this pathogen is. Additional sampling will lead to better understand the evolutionary history of the Cryptococcus species complex in Africa, especially in relation to their association with potential vertebrate hosts.
We used ecological niche modelling to further study the abiotic niche requirements for C. gattii and C. neoformans. These analyses confirmed the influence of climate on the distribution of the two species by predicting their environmental tolerances (Kearney & Porter 2004). Projections by maxent closely matched the two main ecoregions; the Central Miombo woodlands for C. gattii and the ZM for C. neoformans. Relevant climatic variables explaining C. neoformans distribution under our maxent model were also identified in our manova (Table S5, Supporting information) and best variables rank correlation test (BEST) approach (Appendix S1) with isothermality and precipitation seasonality accounting for most of the model variation, describing 71% and 22% of C. neoformans distribution, respectively. Precipitation clearly plays a significant role in predicting the distribution of C. gattii with five precipitation‐associated data layers contributing to our environmental model by more than 58% compared to only 25% for C. neoformans. Together, these niche models show that C. neoformans is associated with lower altitude and drier regions of Zambia (which is where C. mopane predominates) whereas C. gattii has a predilection for higher altitude and wetter environs (which where the Miombo Brachystegia‐dominated woodlands occur). More generally, for other pathogens, altitude is known to be a strong predictor of their ecotope (Messenger et al. 2015). Alongside altitude being a strong predictor of factors such as rainfall, altitude can also act as a physical barrier which limits gene flow and the dispersal of populations, leading to genetic isolation and speciation (Losos & Glor 2003). Investigating the factors associated with speciation may have practical implications on the potential for Cryptococcus species to adapt to future environments, for example, by estimating these pathogens response to climate change (Thomas et al. 2012).
In their natural environment, species of Cryptococcus do not exist in isolation and are embedded within a rich community of fungi, which are one of the most diverse groups of organisms on earth (Tedersoo et al. 2014). Understanding fungal community structure in the context of the distribution of Cryptococcus is important. This is because competitive interactions will, alongside the abiotic factors that we have here described, underlie the ability of Cryptococcus to proliferate as well as to invade and establish within ecoregions. Until recently, however, the sheer diversity of fungal species on earth has overwhelmed our ability to characterize their richness in nature. Here we show, using high‐throughput fungal barcoding, that ecological niche not only impacts on the distribution of the pathogenic species of Cryptococcus that we studied but also more broadly across the fungal kingdom in Zambia. We found that fungal community structure differed substantially across ecoregions and environmental conditions were found to be highly predictive of this structure, both seasonally and spatially (Fig. 1). Spatial and climatic factors are known to delineate organisms’ fundamental niche and therefore affect their distribution (Kozak et al. 2008). In support of this β−diversity (dissimilarity) analyses revealed that a fundamentally different fungal community composition occurred between the Mopane‐ and Miombo‐dominated ecoregions.
Due to the increasing movement of fungi globally, the risk of outbreaks from environmental pathogens is increasing; therefore, understanding the biotic and abiotic factors associated with such pathogens will help to foresee potential changes to their distribution and to potentially predict the emergence of disease (Wolfe et al. 2007; Fisher et al. 2012). Understanding the ecological adaptations that underpin fungal distributions will help towards a better understanding of biological mechanisms governing disease emergence (Giraud et al. 2010). The association of C. gattii, C. neoformans and the lineages therein, with different biotic and abiotic factors in Zambia underscores the adaptation of these species to different environments. Our study represents the first attempt to understand the broader community structure that is associated with a fungal pathogen species complex. We argue that understanding the ecological structuring and life‐history attributes of these pathogens are required to understand their potential to adapt to new environments, either climatic or host. Predicting disease distributions is likely to become more accurate in the near future as microbial assemblages from novel environments are becoming increasingly characterized in Big Data projects, such as the Earth Microbiome Project (http://www.earthmicrobiome.org/). Integrating macroecological analyses with population genomic data in order to link ecosystem‐level patterns to local‐scale epidemiological predictions is the ultimate goal of studies such as that which we have described here.
M.V. wrote the manuscript and performed the analyses with the help of J.R. and M.A.B. using the laboratory facilities of M.C.F., T.B. and T.S.H. implemented the clinical trial with the help of S.M., N.L., N.K., D.C., S.L. and G.K. who collected the clinical Cryptococcus neoformans isolates.
Data accessibility
All raw reads and lineages information regarding each isolates have been submitted to the European Nucleotide Archive (MiSeq ITS project: PRJEB13820; WGS Cn Project: PRJEB13814). MiSeq ITS sample names can be found on Table S2 and isolates for WGS are listed on Table S9 (Supporting information). We welcome inquiries from all parties regarding access to cryptococcal strains or fungal sequences.
Supporting information
Acknowledgements
MV was supported by a UK Natural Environmental Research Council PhD studentship. JR and MAB were supported by a UK Medical Research Council Grant to MCF, TH and TB. We thank Michael Fisher, Ian Bruce‐Miller, Emma Bruce‐Miller and Mark Harvey for invaluable support in the field.
Contributor Information
Mathieu Vanhove, Email: m.vanhove12@imperial.ac.uk.
Matthew C. Fisher, Email: matthew.fisher@imperial.ac.uk
References
- Abarenkov K, Henrik Nilsson R, Larsson K et al (2010) The UNITE database for molecular identification of fungi – recent updates and future perspectives. New Phytologist, 186, 281–285. [DOI] [PubMed] [Google Scholar]
- Anderson MJ (2001) A new method for non‐parametric multivariate analysis of variance. Austral Ecology, 26, 32–46. [Google Scholar]
- Bazzicalupo AL, Bálint M, Schmitt I (2013) Comparison of ITS1 and ITS2 rDNA in 454 sequencing of hyperdiverse fungal communities. Fungal Ecology, 6, 102–109. [Google Scholar]
- Beale MA, Sabiiti W, Robertson EJ et al (2015) Genotypic diversity is associated with clinical outcome and phenotype in Cryptococcal Meningitis across Southern Africa. PLoS Neglected Tropical Diseases, 9, e0003847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Busby J (1991) BIOCLIM ‐ a bioclimate analysis and prediction system. Plant Protection Quarterly, 6, 8–9. [Google Scholar]
- Caporaso JG, Kuczynski J, Stombaugh J et al (2010) QIIME allows analysis of high‐throughput community sequencing data. Nature Methods, 7, 335–336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casadevall A, Perfect JR (1998) Cryptococcus neoformans. ASM Press, Washington, District of Columbia. [Google Scholar]
- Casadevall A, Steenbergen JN, Nosanchuk JD (2003) “Ready made” virulence and “dual use” virulence factors in pathogenic environmental fungi – the Cryptococcus neoformans paradigm. Current Opinion in Microbiology, 6, 332–337. [DOI] [PubMed] [Google Scholar]
- Chayakulkeeree M, Perfect JR (2008) Cryptococcosis In: Diagnosis and Treatment of Human Mycoses (ed. Hospenthal DR, Rinaldi MG.), pp. 255–276. Springer, New York. [Google Scholar]
- Chen Y, Litvintseva AP, Frazzitta AE et al (2015) Comparative analyses of clinical and environmental populations of Cryptococcus neoformans in Botswana. Molecular Ecology, 24, 3559–3571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chidumayo EN (2013) Forest degradation and recovery in a miombo woodland landscape in Zambia: 22 years of observations on permanent sample plots. Forest Ecology and Management, 291, 154–161. [Google Scholar]
- Clarke KR (1993) Non‐parametric multivariate analyses of changes in community structure. Australian Journal of Ecology, 18, 117. [Google Scholar]
- Connell JH (1961) Effects of competition, predation by Thais lapillus, and other factors on natural populations of the barnacle Balanus balanoides . Ecological Monographs, 31, 61–104. [Google Scholar]
- DePristo MA, Banks E, Poplin R et al (2011) A framework for variation discovery and genotyping using next‐generation DNA sequencing data. Nature Genetics, 43, 491–498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Desalermos A, Kourkoumpetis TK, Mylonakis E (2012) Update on the epidemiology and management of cryptococcal meningitis. Expert Opinion on Pharmacotherapy, 13, 783–789. [DOI] [PubMed] [Google Scholar]
- Desnos‐Ollivier M, Patel S, Spaulding AR et al (2010) Mixed infections and in vivo evolution in the human fungal pathogen Cryptococcus neoformans . MBio, 1, e00091–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar RC (2010) Search and clustering orders of magnitude faster than BLAST. Bioinformatics, 26, 2460–2461. [DOI] [PubMed] [Google Scholar]
- Ellis DH, Pfeiffer TJ (1990) Natural habitat of Cryptococcus neoformans var. gattii . Journal of Clinical Microbiology, 28, 1642–1644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fierer N, Lauber CL, Zhou N et al (2010) Forensic identification using skin bacterial communities. Proceedings of the National Academy of Sciences, USA, 107, 6477–6481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fisher MC, Henk DA, Briggs CJ et al (2012) Emerging fungal threats to animal, plant and ecosystem health. Nature, 484, 186–194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giraud T, Gladieux P, Gavrilets S (2010) Linking the emergence of fungal plant diseases with ecological speciation. Trends in Ecology & Evolution, 25, 387–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Granados DP, Castañeda E (2005) Isolation and characterization of Cryptococcus neoformans varieties recovered from natural sources in Bogotá, Colombia, and study of ecological conditions in the area. Microbial Ecology, 49, 282–290. [DOI] [PubMed] [Google Scholar]
- Hagen F, Khayhan K, Theelen B et al (2015) Recognition of seven species in the Cryptococcus gattii/Cryptococcus neoformans species complex. Fungal Genetics and Biology, 78, 16–48. [DOI] [PubMed] [Google Scholar]
- Hardin G (1960) The competitive exclusion principle. Science, 131, 1292–1297. [DOI] [PubMed] [Google Scholar]
- Hengl T, Heuvelink GBM, Kempen B et al (2015) Mapping soil properties of Africa at 250 m resolution: random forests significantly improve current predictions. PLoS One, 10, e0125814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hijmans RJ, Phillips S, Leathwick J, Elith J (2012) dismo: Species Distribution Modeling. R package version 0.7‐17. http://dssm.unipa.it/CRAN/web/packages/dismo/ [Google Scholar]
- Hiremath SS, Chowdhary A, Kowshik T et al (2008) Long‐distance dispersal and recombination in environmental populations of Cryptococcus neoformans var. grubii from India. Microbiology, 154, 1513–1524. [DOI] [PubMed] [Google Scholar]
- Hoang LMN, Maguire JA, Doyle P, Fyfe M, Roscoe DL (2004) Cryptococcus neoformans infections at Vancouver Hospital and Health Sciences Centre (1997–2002): epidemiology, microbiology and histopathology. Journal of Medical Microbiology, 53, 935–940. [DOI] [PubMed] [Google Scholar]
- Hutchinson GE (1959) Homage to Santa Rosalia or why are there so many kinds of animals? The American Naturalist, 93, 145–159. [Google Scholar]
- Hutter S, Vilella AJ, Rozas J (2006) Genome‐wide DNA polymorphism analyses using VariScan. BMC Bioinformatics, 7, 409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson MTJ, Stinchcombe JR (2007) An emerging synthesis between community ecology and evolutionary biology. Trends in Ecology & Evolution, 22, 250–257. [DOI] [PubMed] [Google Scholar]
- Kearney M, Porter WP (2004) Mapping the fundamental niche: physiology, climate, and the distribution of a nocturnal lizard. Ecology, 85, 3119–3131. [Google Scholar]
- Kidd SE, Hagen F, Tscharke RL et al (2004) A rare genotype of Cryptococcus gattii caused the cryptococcosis outbreak on Vancouver Island (British Columbia, Canada). Proceedings of the National Academy of Sciences, USA, 101, 17258–17263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozak KH, Graham CH, Wiens JJ (2008) Integrating GIS‐based environmental data into evolutionary biology. Trends in Ecology & Evolution, 23, 141–148. [DOI] [PubMed] [Google Scholar]
- Lawson DJ, Hellenthal G, Myers S, Falush D (2012) Inference of population structure using dense haplotype data. PLoS Genetics, 8, e1002453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows‐Wheeler transform. Bioinformatics, 25, 1754–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Handsaker B, Wysoker A et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics, 25, 2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lim YW, Kwon Kim B, et al. (2010) Assessment of soil fungal communities using pyrosequencing. The Journal of Microbiology, 48, 284–289. [DOI] [PubMed] [Google Scholar]
- Lindahl BD, Nilsson RH, Tedersoo L et al (2013) Fungal community analysis by high‐throughput sequencing of amplified markers – a user's guide. New Phytologist, 199, 288–299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Litvintseva AP, Marra RE, Nielsen K et al (2003) Evidence of sexual recombination among Cryptococcus neoformans serotype A isolates in sub‐Saharan Africa. Eukaryotic Cell, 2, 1162–1168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Litvintseva AP, Kestenbaum L, Vilgalys R, Mitchell TG (2005) Comparative analysis of environmental and clinical populations of Cryptococcus neoformans . Journal of Clinical Microbiology, 43, 556–564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Litvintseva AP, Thakur R, Vilgalys R, Mitchell TG (2006) Multilocus sequence typing reveals three genetic subpopulations of Cryptococcus neoformans var. grubii (serotype A), including a unique population in Botswana. Genetics, 172, 2223–2238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Litvintseva AP, Carbone I, Rossouw J et al (2011) Evidence that the human pathogenic fungus Cryptococcus neoformans var. grubii may have evolved in Africa. PLoS One, 6, e19688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loftus BJ, Fung E, Roncaglia P et al (2005) The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans . Science, 307, 1321–1324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Losos JB, Glor RE (2003) Phylogenetic comparative methods and the geography of speciation. Trends in Ecology & Evolution, 18, 220–227. [Google Scholar]
- Magurran AE (2013) Measuring Biological Diversity. Blackwell Science, Oxford, UK. ISBN: 978‐0‐632‐05633‐0. [Google Scholar]
- Mak SY (2007) Ecological Niche Modeling of Cryptococcus gattii in British Columbia (T). University of British Columbia: https://open.library.ubc.ca/cIRcle/collections/831/items/1.0093069 (Original work published 2007) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mak S, Vélez N, Castañeda E, Escandón P (2015) The fungus among us: Cryptococcus neoformans and Cryptococcus gattii ecological modeling for Colombia. Journal of Fungi, 1, 332–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Masella AP, Bartram AK, Truszkowski JM, Brown DG, Neufeld JD (2012) PANDAseq: paired‐end assembler for illumina sequences. BMC Bioinformatics, 13, 31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKenna A, Hanna M, Banks E et al (2010) The genome analysis toolkit: a MapReduce framework for analyzing next‐generation DNA sequencing data. Genome Research, 20, 1297–1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Messenger LA, Garcia L, Vanhove M et al (2015) Ecological host fitting of Trypanosoma cruzi TcI in Bolivia: mosaic population structure, hybridization and a role for humans in Andean parasite dispersal. Molecular Ecology, 24, 2406–2422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meyer W, Aanensen DM, Boekhout T et al (2009) Consensus multi‐locus sequence typing scheme for Cryptococcus neoformans and Cryptococcus gattii . Medical Mycology, 47, 561–570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei M, Li W‐H (1979) Mathematical model for studying genetic variation in terms of restriction endonucleases. Proceedings of the National Academy of Sciences, USA, 76, 5269–5273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nielsen K, De Obaldia AL, Heitman J (2007) Cryptococcus neoformans mates on pigeon guano: implications for the realized ecological niche and globalization. Eukaryotic Cell, 6, 949–959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O'Hanlon SJ, Slater HC, Cheke RA et al (2016) Model‐based geostatistical mapping of the prevalence of Onchocerca volvulus in West Africa. PLoS Neglected Tropical Diseases, 10, e0004328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Oliveira MTB, Boekhout T, Theelen B et al (2004) Cryptococcus neoformans shows a remarkable genotypic diversity in Brazil. Journal of Clinical Microbiology, 42, 1356–1359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfeiffer TJ, Ellis DH (1992) Environmental isolation of Cryptococcus neoformans var. gattii from Eucalyptus tereticornis . Medical Mycology, 30, 407–408. [PubMed] [Google Scholar]
- Phillips SJ, Dudík M (2008) Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography, 31, 161–175. [Google Scholar]
- Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecological Modelling, 190, 231–259. [Google Scholar]
- Rambaut A, Lam TT, Carvalho LM, Pybus OG (2016) Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path‐O‐Gen). Virus Evolution, 2, vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Randhawa HS, Kowshik T, Khan ZU (2005) Efficacy of swabbing versus a conventional technique for isolation of Cryptococcus neoformans from decayed wood in tree trunk hollows. Medical Mycology, 43, 67–71. [DOI] [PubMed] [Google Scholar]
- Rhodes J, Beale MA, Fisher MC (2014) Illuminating choices for library prep: a comparison of library preparation methods for whole genome sequencing of Cryptococcus neoformans using Illumina HiSeq. PLoS One, 9, e113501. doi: 10.1371/journal.pone.0113501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoch CL, Seifert KA, Huhndorf S et al (2012) Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proceedings of the National Academy of Sciences, USA, 109, 6241–6246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Segata N, Izard J, Waldron L et al (2011) Metagenomic biomarker discovery and explanation. Genome Biology, 12, R60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simwami SP, Khayhan K, Henk DA et al (2011) Low diversity Cryptococcus neoformans variety grubii multilocus sequence types from Thailand are consistent with an ancestral African origin. PLoS Pathogens, 7, e1001343. doi: 10.1371/journal.ppat.1001343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spearman C (1904) The proof and measurement of association between two things. The American Journal of Psychology, 15, 72–101. [PubMed] [Google Scholar]
- Staib F (1987) Cryptococcosis in the acquired immunodeficiency syndrome; mycological‐diagnostic and epidemiological observations. AIDS‐Forschung (AIFO), 7, 363‐382. [Google Scholar]
- Stamatakis A (2006) RAxML‐VI‐HPC: maximum likelihood‐based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics, 22, 2688–2690. [DOI] [PubMed] [Google Scholar]
- Tajima F (1989) Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics, 123, 585–595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tedersoo L, Bahram M, Põlme S et al (2014) Global diversity and geography of soil fungi. Science, 346, 1256688. [DOI] [PubMed] [Google Scholar]
- Thomas JC, Godfrey PA, Feldgarden M, Robinson DA (2012) Candidate targets of balancing selection in the genome of Staphylococcus aureus . Molecular Biology and Evolution, 29, 1175–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Toju H, Tanabe AS, Yamamoto S, Sato H (2012) High‐coverage ITS primers for the DNA‐based identification of ascomycetes and basidiomycetes in environmental samples. PLoS One, 7, e40863. doi: 10.1371/journal.pone.0040863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tuck SL, Phillips HRP, Hintzen RE et al (2014) MODISTools–downloading and processing MODIS remotely sensed data in R. Ecology and Evolution, 4, 4658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- UNAIDS JUNP on H (2015) Global Report: Joint United Nations Programme on HIV/AIDS (UNAIDS) (2013). UNAIDS, Geneva, 2013. According to the UNAIDS’ estimate the number of new infections in the region increased from, 21, 0–22. ISBN: 978‐92‐9253‐032‐7. [Google Scholar]
- Van der Auwera GA, Feldgarden M, Kolter R, Mahillon J (2013) Whole‐genome sequences of 94 environmental isolates of Bacillus cereus sensu lato. Genome Announcements, 1, e00380‐13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Wyk M, Govender NP, Mitchell TG, Litvintseva AP (2014) Multilocus sequence typing of serially collected isolates of Cryptococcus from HIV‐infected patients in South Africa. Journal of Clinical Microbiology, 52, 1921–1931, JCM–03177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watterson GA (1975) On the number of segregating sites in genetical models without recombination. Theoretical Population Biology, 7, 256–276. [DOI] [PubMed] [Google Scholar]
- White F (1983) The Vegetation of Africa, a Descriptive Memoir to Accompany the UNESCO/AETFAT/UNSO Vegetation Map of Africa (3 Plates, Northwestern Africa, Northeastern Africa, and Southern Africa, 1:5,000,000). UNESCO, Paris. ISBN 9231019554. [Google Scholar]
- Whittaker RH (1960) Vegetation of the Siskiyou mountains, Oregon and California. Ecological Monographs, 30, 279–338. [Google Scholar]
- Wiesner DL, Moskalenko O, Corcoran JM et al (2012) Cryptococcal genotype influences immunologic response and human clinical outcome after meningitis. MBio, 3, e00196‐12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wolfe ND, Dunavan CP, Diamond J (2007) Origins of major human infectious diseases. Nature, 447, 279–283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- World Wildlife Fund (2011) Ecoregions of Zambia. http://wwf.panda.org
- Xu J (2006) Fundamentals of fungal molecular population genetic analyses. Current Issues in Molecular Biology, 8, 75. [PubMed] [Google Scholar]
- Xu J, Vilgalys R, Mitchell TG (2000) Multiple gene genealogies reveal recent dispersion and hybridization in the human pathogenic fungus Cryptococcus neoformans . Molecular Ecology, 9, 1471–1481. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All raw reads and lineages information regarding each isolates have been submitted to the European Nucleotide Archive (MiSeq ITS project: PRJEB13820; WGS Cn Project: PRJEB13814). MiSeq ITS sample names can be found on Table S2 and isolates for WGS are listed on Table S9 (Supporting information). We welcome inquiries from all parties regarding access to cryptococcal strains or fungal sequences.