SUMMARY
Isolates of Cryptococcus neoformans, a fungal pathogen that kills 112,000 people each year, differ from a 19-megabase reference genome at a few thousand up to almost a million DNA sequence positions. We used bulked segregant analysis and association analysis, genetic methods that require no prior knowledge of sequence function, to address the key question of which naturally occurring sequence variants influence fungal virulence. We identified a region containing such variants, prioritized them, and engineered strains to test our findings in a mouse model of infection. At one locus, we identified a 4-nt variant in the PDE2 gene that occurs in common laboratory strains and severely truncates the encoded phosphodiesterase. The resulting loss of phosphodiesterase activity significantly impacts virulence. Our studies demonstrate a powerful and unbiased strategy for identifying key genomic regions in the absence of prior information and provide significant sequence and strain resources to the community.
Keywords: Cryptococcus neoformans, bulked segregant analysis, natural sequence variants, Pde2
Graphical Abstract

eTOC
Agustinho et al. identified naturally occurring genome sequence variants that influence virulence of the fungal pathogen Cryptococcus neoformans, using genetic strategies that required no prior knowledge of sequence function. This offers a powerful approach for discovery of sequences relevant to virulence or other phenotypes of interest. [46]
INTRODUCTION
The pathogenic fungus Cryptococcus neoformans causes lethal meningoencephalitis that kills 112,000 people worldwide each year 1. The availability of genome sequence, beginning almost twenty years ago 2, has propelled fundamental research on this pathogen forward. This information has served as a tremendous resource for scientific investigation and as the scaffold for high-impact genome-scale resources, from microarrays used in early expression analysis3 to deletion libraries.4–6
To decipher cryptococcal virulence, the research community has deployed multiple genetic approaches. Individual genes have been studied by perturbing specific loci of interest, while mutant collections have enabled phenotypic screening. Other approaches rely on crossing parental strains that differ in phenotype or phylogeny. For example, quantitative trait locus (QTL) analysis of recombinant populations has been used to broadly map genomic regions associated with key phenotypes 7,8. In parallel with these studies, which have been based on reference strains, large scale sequencing of cryptococcal isolates has progressed rapidly 9.
Thousands of C. neoformans isolates from clinical or environmental sources have been used to elucidate C. neoformans evolution 9–21 and in efforts to correlate disease outcome with in vitro measures such as virulence factor production or fungal growth 13,22–26. It is clear that distinct strain lineages are associated with varied clinical outcomes 13,18,21,27–32. Furthermore, natural genomic sequence variation has been associated with levels of virulence, both in human infection and in mouse studies 28,29,33,34. However, the identification of specific sequence variants that causally influence the virulence of clinical strains has remained a considerable challenge.
We have developed and validated an unbiased, genetic strategy to identify naturally occurring sequence variants that impact C. neoformans virulence. Our whole-genome approach can potentially reveal key variants in novel genes, directing research attention to their products. It can also highlight specific variants in genes that have already been shown to impact virulence, which can lead to mechanistic understanding of the encoded proteins. Additionally, and in contrast to studies based on gene deletion 4, our strategy can identify critical variants in essential genes, regulatory sequences, and non-annotated or mis-annotated regions of the genome.
Valuable information relating fungal genotype to disease outcome has been derived from strains isolated from patients with cryptococcosis and their accompanying clinical records. One challenge of such studies is that complex host factors contribute to outcome, including patient genotype, known and unknown comorbidities, treatment, and healthcare setting 35. This complexity limits the power of these analyses. To circumvent this, we used mouse models of infection to assess the virulence of strains derived from clinical isolates. This approach is supported in the literature, which shows a strong correlation between mortality in humans and mice infected with the same C. neoformans strain 27. Another challenge for our plans to exploit genetic analysis is the large haplotype blocks observed in clinical isolates, a result of limited recombination in the wild 14,21. To address this, we have taken advantage of the sexual cycle of C. neoformans 36,37, which allows us to generate recombinant progeny for study.
We applied genetic approaches to analyze recombinant progeny derived from a cross between a well characterized and highly virulent laboratory strain (KN99 38) and a clinical strain that exhibits low virulence in our mouse model (C8 39). Excitingly, our unbiased approach efficiently identified individual causal variants responsible for virulence differences among C. neoformans isolates. We identified and experimentally validated sequence variants that both increase and decrease virulence, showing that even relatively less pathogenic strains harbor variants that increase virulence. We also showed that the phosphodiesterase Pde2 lacks activity in commonly used C. neoformans laboratory strains, significantly reducing their virulence.
RESULTS
The first step in our strategy was to select clinical isolates for genetic crosses. For one parent, we chose C. neoformans strain KN99a, which is the congenic partner of KN99α 38, a reference genome strain for C. neoformans 40. The KN99 strains, which are derived from the clinical isolate H99 41, are highly virulent in animal models and mate robustly 38. For the second parent we wanted a clinical isolate that differed significantly from KN99 in terms of genome sequence and virulence, but also mated well, which is not typical of these isolates. To identify such a strain among the 73 clinical isolates that our laboratory had on hand, we first compared their genome sequences (either obtained online or sequenced in-house; see Methods and Supplemental Table 1, sheet A) to that of KN99α. This analysis yielded a total of 1,072,542 distinct genomic variants relative to KN99. (Here we use variants to refer to nucleotide substitutions or indels shorter than 50 bp.)
Even when patient data is available, it is challenging to directly compare the virulence of clinical isolates because of the confounding host factors mentioned above. For this reason, we used an animal model of cryptococcal infection to compare clinical strains to the laboratory strain KN99. To do this, we infected mice intranasally and assessed colony forming units (CFU) in the lung nine days after infection as a proxy for virulence. We observed a 14,000-fold range in lung burden between clinical isolates, with strains that were both more and less virulent than KN99a (Figure 1, top).
Figure 1.

Lung burden after cryptococcal infection. Individual C57BL/6J mice were intranasally inoculated with 12,500 cryptococcal cells of KN99a, C8 (orange bar), or other strains of interest. Lung burden was measured 9 days post-infection by colony forming units (CFU), which are plotted for each strain relative to the value for KN99a (mean and SD of technical triplicates). Top, 73 clinical isolates. Bottom, 93 recombinants derived by crossing KN99a and C8.
Almost all of the clinical strains we examined, like most cryptococcal isolates, are mating type alpha 42. To evaluate their ability to produce mating structures, we crossed them to KN99a on V8 medium 43. Of the 73 strains, 34 exhibited some level of filamentation under our standard conditions (see Methods). We used microdissection to collect spores from the four crosses that showed the highest filamentation in standard conditions and sequenced at least five progeny strains from each. Two of the crosses produced progeny nearly identical to one of the parents (>99% of alleles in common), possibly generated by the recently described process of pseudosexual reproduction 44. The other two generated the recombinant progeny expected from sexual reproduction; an example is depicted in Figure 2.
Figure 2.

Representation of the whole genome sequence of one recombinant strain derived from a cross between C8 and KN99a. Each bar represents one of the 14 chromosomes of C. neoformans. Orange, haplotypes derived from C8; blue, haplotypes derived from KN99a.
We selected clinical strain C8 39 (genome sequence accession SRX189616) for our studies, based on its virulence, robust mating, and production of recombinant progeny when crossed to KN99a. This VNI subtype strain was isolated from the cerebrospinal fluid of an HIV+ cryptococcosis patient in the United States 39. It has 48,934 genomic variants (45,343 SNVs and 3,591 indels) compared to the KN99α reference sequence, which are distributed throughout the genome. Notably, its virulence in our mouse model is ~100 times lower than that of KN99a (Figure 1, top panel). We crossed these two strains and used microdissection to isolate 138 of the resulting spores, which were then cultured and stored as frozen stocks (see Methods and Supplemental Table 1, sheet B).
To identify sequence variants that would explain the virulence difference we observed between the KN99 and C8 parent strains, we took two distinct approaches: Bulked Segregant Analysis (BSA) and Association Analysis (AA). In BSA, which was first developed as a tool for plant genetic mapping 45, segregants from a single cross are divided based on a specific characteristic of interest and the resulting populations are subjected to molecular analysis. In our adaptation of this strategy, we collected populations of recombinants based on growth, either in rich laboratory medium or in the mouse lung after intranasal infection, and analyzed them by DNA sequencing. Our rationale was that alleles that are beneficial in either growth condition will increase in frequency, those that are deleterious will decrease, and those that are neutral will drift randomly. We postulated that alleles that are significantly enriched when cells are grown in the mouse lung are likely to be virulence-enhancing, while those that are significantly depleted are likely virulence-reducing. To identify such alleles, we randomly selected 100 of the C8 × KN99 progeny and combined them into five pools of 40 strains each, such that each strain was present in two distinct pools. Each pool was prepared in duplicate, with equal cell numbers of each strain. From these ten samples, aliquots were reserved as starting material; used to inoculate a culture of rich medium (YPD); and used to infect one mouse intranasally. We extracted DNA for sequencing from the starting material (inoculum), cells recovered from the YPD culture after 20 hours of growth at 30°C, and cells isolated from mouse lungs after 9 days of infection. To ensure that we could detect small changes in allele frequencies, all samples were sequenced to a minimum of 100-fold coverage by 2 × 150 bp, paired-end reads (Supplemental Table 1, sheet C).
Next, we evaluated each variant site in the genome for evidence of positive or negative selection by growth in YPD or mouse lung. To do this we compared each allele’s frequencies after growth to the frequency in the inoculum and calculated g’ statistics 46. We then summed the read counts across all pools from each condition for each allele and did the same frequency calculation. The top panel of Figure 3 shows the data for chromosome 2, with changes in allele frequency towards C8 arbitrarily assigned as positive and changes towards KN99 shown as negative. Results from the YPD samples (plotted in black) show little change in allele frequency from the starting material, with data points close to the x-axis all along the chromosome; this pattern was maintained throughout the genome (Supplemental Figures 1–3). In contrast, we observed large changes in allele frequency along this chromosome for samples that were isolated from the lungs of infected animals (plotted in red).
Figure 3.

Bulked segregant analysis and association analysis of C. neoformans chromosome 2. Distinct methods implicate the same genomic region in virulence. Top panel, changes in allele frequency for experimental samples compared to initial pools, with positive and negative values arbitrarily assigned to changes in the direction of C8 and KN99, respectively. Values are plotted for 5-kb windows of Chromosome 2, smoothed as described in the Methods. Symbols, mean value for all pools; vertical lines, range of individual pool values; black, YPD samples; red, mouse lung samples; yellow shaded region, IR-1 (see text). Larger symbols and darker lines indicate regions of statistical significance, as defined in the text. Results for the rest of the genome are in Supplemental Figures 1–3. Bottom panel, association analysis results for all variants in Chromosome 2. The red line shows the t-statistic comparing the virulence (measured as log2 fold-change of CFU compared to KN99) of strains with each parental allele at each variable site along the chromosome. Dashed lines, P-value thresholds for 0.001 (top line) and 0.01 (bottom line). P-value calculations are detailed in the Methods.
We examined the BSA data for regions of the genome where changes in allele frequency were (a) statistically significant (false discovery rate < 0.05) for lung samples but not YPD samples and (b) varied in the same direction for the mean of the pools and for each individual pool. One such region, which shows close agreement between individual pools and had the highest change in allele frequency, occurs on chromosome 2 between positions 283,000 and 467,000 (highlighted in yellow on Figure 3, top panel). We termed this Implicated Region 1 (IR-1). It shows an increase in C8 allele frequencies in lung samples (red symbols), suggesting that one or more alleles in this region confers a growth advantage in this host environment. Changes in IR-1 allele frequencies for the pools grown in YPD (black symbols) were not significant, suggesting that the alleles in this region are neither beneficial nor deleterious for growth in rich medium.
We next used a completely different assay and statistical methodology to examine the same group of 100 progeny strains for variants important for virulence, by asking whether there was a statistical association between the presence of specific alleles and virulence in mice when strains were tested individually. To perform this association analysis (AA), we first tested each recombinant strain alone in mice (Figure 1, bottom panel), using lung burden (log2 fold-change in CFU compared to KN99) as a surrogate for virulence. Notably, the range of virulence of the recombinants greatly exceeds the range defined by the parental strains, showing that each parent harbors alleles that are both advantageous and deleterious for this phenotype.
Next, we sequenced the individual recombinant strains and calculated the association between the presence of the C8 variant and virulence at each variant position. To compute P-values, we first computed the t-statistic comparing the group with the C8 allele at that site to the group with the KN99 allele. Rather than comparing this statistic to a theoretical null distribution, we compared it to an empirical null obtained by permuting virulence scores 10,000 times and recalculating the statistics on the permuted virulence levels. For each permutation, we used the most significant t-statistic in the entire genome for the null model; this adjusts for multiple hypothesis testing. Statistics and selected genome-wide P-value thresholds are shown in Figure 3, lower panel. The results from the AA closely mirrored those obtained with BSA, with IR-1 showing the highest statistical significance of any region of the genome. These results validated the BSA approach, which has significant advantages in terms of the number of animals required, the experimental effort, and the potential resolution (see Discussion).
Our results do not imply that all C8 alleles in IR-1 confer a virulence advantage: one or a few alleles in this region could be responsible for the effect, with others being implicated due to linkage (see Discussion). To test the impact of specific alleles, we genetically engineered C. neoformans to swap alleles between the two parent strains. To select a pilot locus for this approach, we examined the sequences within IR-1. For protein-coding genes, we looked for the presence of variants which would change one or more amino acids, as an indication of potential perturbation of protein structure or function (Supplemental Table 2). For genes where corresponding deletion mutants were available 4, we assessed these mutants in our mouse model, as an indication of the role that sequence plays in virulence (Supplemental Figure 4, top panel, and Supplemental Table 2). Based on these results, we opted to first test a gene identified as CKF44_03628 (VPS45) that contains two variants, one in the 5’-UTR and another that changes a serine at position 417 to tryptophan. Deletion of this gene resulted in significantly reduced lung burden (Supplemental Figure 4, top panel, and Supplemental Table 2), consistent with the literature 47. To test whether the small nucleotide variants (SNVs) in VPS45 contribute to the virulence difference between C8 and KN99a, we engineered reciprocal sequence swaps in the background of each parent strain. Engineering these changes required the introduction of a selectable marker; to control for any marker effect, we transformed each parental strain with both the original sequence and the new sequence (from the other parent). We confirmed all strains by whole genome sequencing (WGS; Supplemental Table 1, sheet D) and tested them in mice as above.
Notably, the insertion of a drug marker adjacent to VPS45, with no other sequence change, significantly reduced the lung burden of each parent strain (Figure 4, left, compare KN99 to KN99K and C8 to C8C; see Discussion). Introduction of opposite alleles induced additional changes in lung burden: swapping the C8 allele into the KN99 strain reduced virulence (compare KN99K to KN99C), while the reciprocal change, introducing the KN99 allele into the C8 strain, trended towards increased virulence (compare C8C to C8K). We followed up on the 9-day lung burden results with a more sensitive assay of virulence, long term survival. In this experiment the KN99 background strains (Figure 4, right) showed significantly reduced virulence when the endogenous sequence at this locus was changed to that of C8 (orange), compared to a matched control swap (blue) where the sequence was not altered. All of these results strongly associate the KN99 allele at this position with increased virulence. However, this was the opposite of what we had anticipated for a variant within IR-1, a region that our BSA and AA studies indicated was associated with increased virulence of the C8 allele. To explain this, we speculated that VPS45, which occurs at the right edge of IR-1, was included in this region because of linkage to one or more C8 alleles with virulence effects large enough to outweigh the KN99 allele advantage at this locus. To test this hypothesis, we looked for a way to refine IR-1.
Figure 4.

Virulence studies of VPS45 swap strains. Shown are data from intranasal infections of C57BL/6J mice using 12,500 cryptococcal cells of the indicated strain. For this and all subsequent figures, strain names with a superscript have a drug marker inserted near the locus of interest; the base name is the background strain and the superscript indicates the source of the sequence that was swapped into that background (blue, KN99; orange, C8). Left panel, total lung burden 9 days after infection with the indicated strain. Each symbol represents an individual mouse; mean and standard deviation are shown. Comparison was by one-way ANOVA with Tukey’s multiple comparisons test post hoc. **, p < 0.01. Right panel, survival of groups of 8 mice over time after infection, with sacrifice triggered by weight below 80% of peak or signs of disease (see Methods). Survival curve comparison was by Log-rank test with p = 0.0001 for the comparison of KN99K to KN99C.
We had identified IR-1, which is almost 180 kb long and contains 65 genes and 161 variants, by analyzing 100 recombinants from a cross of C8 and KN99. The resolution of such analysis depends on the number of crossovers that occur in the recombinant genomes during meiosis. To refine the boundaries of IR-1, therefore, we needed to increase the number of crossovers in that region. To do this, we attempted to force recombination within IR-1, by performing the same cross with parent strains modified by the insertion of drug resistance cassettes at the ends of IR-1: NAT 48 at the 5’ end in C8 and G418 49 at the 3’ end in KN99a. Selection for progeny able to grow in the presence of both compounds yielded 221 strains (Supplemental Table 1, sheet B), although interestingly, roughly 30% of them showed aneuploidy of chromosome 2 (see Discussion). We used this strain set in a new BSA experiment of the same design as above, although with 87–88 strains per pool, and applied the same criteria we used earlier to identify IR-1 to the results (profiles shown in Supplemental Figures 5–7). This analysis identified a more limited region of chromosome 2, wholly within IR-1, as implicated in virulence. We termed this region IR-1.1 (Figure 5, top). IR-1.1 spans 57,716 bp (chr2: 297,636 – 355,352) and contains 18 genes and 56 variants (Figure 5, bottom). Notably, it excludes VPS45, supporting our conjecture that this locus was implicated in C8 virulence by linkage, rather than because of its inherent properties.
FIGURE 5.

Refinement of IR-1. Top panel, BSA analysis of 221 doubly drug-resistant progeny strains from the cross described in the text, performed and presented as in Figure 3. Results for the rest of the genome are in Supplemental Figures 5–7. Bottom panel, BSA results for each individual variant in IR-1 (Supplemental Table 3). Black dots, mean change in allele frequency across all pools; colored arrows, genes; numbers, numerical portions of gene identifiers for the six sequences tested by allele swap experiments.
We next focused our attention on the 18 genes in IR-1.1 (Figure 5, bottom). Based on the effects of variants in this region on protein sequence and the virulence of deletion strains (Supplemental Table 2), we generated reciprocal swap strains for five additional genes: CKF44_03608, CKF44_03619, CKF44_07469, CKF44_07964, and CKF44_07470 (Supplemental Table 1, sheet D). For genes with more than one variant, we chose transformants where all had been swapped for testing. For the first four of these genes, we observed no differences in virulence between marked parental strains bearing either the original or swapped alleles (Supplemental Figure 8), indicating that none of their variants were responsible for the virulence advantage conferred by C8 sequences in IR-1.1. For CKF44_07470, however, we observed increased lung burden at 15 days post-infection in mice infected with marked strains that contained the C8 allele compared to marked strains that contained the KN99 allele (Figure 6, left). Survival studies were consistent with these burden results, showing that KN99 strains with the C8 allele caused more rapid decline of mice than matched strains with KN99 sequence at this locus (Figure 6, right). We saw the same pattern with strains in the C8 background, with higher burden and shorter survival when the C8 allele was present (Supplemental Figures 9 and 10).
Figure 6.

Demonstration that the C8 allele of CKF44_07470 increases the virulence of KN99. Strain designations are as above: the base name indicates the background strain and the superscript indicates the allele that was swapped into that background. Left, mean ± SD of lung burden 15 days after mouse infection; each symbol represents one mouse. ****, P ≤ 0.0001 by t-test comparing KN99K and KN99C. Brain burdens are shown in Supplemental Figure 9. Right, mouse survival over time, analyzed by Log-rank test.
Our swap studies strongly suggested that one or more C8 variants in the sequence of CKF44_07470 was responsible for conferring a survival advantage on fungal cells in the context of an infected host. This gene, named PDE2 because it encodes the phosphodiesterase Pde2 50, has three differences between KN99 and C8 (Figure 7, panel A): a 4-nucleotide insertion in C8 in the second intron and two differences in coding regions near the 3’ end of the gene (one synonymous and one missense variant in exon 7). To determine which of these was responsible for the virulence changes we had observed, we separated the variants at each end of the gene by engineering a strain in the KN99 background where the only sequence change in PDE2 was incorporation of the intron variant (KN99CKK). To our surprise, the presence of this variant alone was sufficient to reproduce the increase in organ burden we had previously observed when all variants were swapped (Figure 7B).
Figure 7.

Demonstration that the intron variant of C8 is sufficient to increase virulence. A, variants in C8 relative to KN99. B, mean ± SD of total lung burden at 15 days (each symbol represents one mouse). Strain names are as above except that the three letter superscripts represent the three SNVs of CKF44_07470 from 5’ to 3’. Comparison was by unpaired t-test: ns, not significant; ****, p ≤ 0.0001. C, read depth from RNA-seq experiments for the chromosome 2 region that contains CKF44_07470 (PDE2). Each color represents an independent replicate study using KN99; results for C8 were similar. Below the expression profiles is the current annotation for this region (white arrows, exons; gray segments, intron), with features that differ in C8 versus KN99 sequence indicated in blue. The sequence annotated as the second intron (boxed in blue) is present in mRNA at levels comparable to those of flanking exons, in contrast to all other introns, which show minimal reads. Blue triangle, site of marker insertion. D, anti-FLAG immunoblot of protein lysates from the indicated strains engineered to express N-terminally tagged Pde2. Migration position of standards is shown at the right in kDa. Expected protein sequences are in Supplemental Figure 11.
We wondered whether the 4-bp insertion in intron 2 of C8 might influence virulence by altering the splicing or expression of PDE2 mRNA. To address this, we performed RNA-seq analysis on each parent strain grown in host-like conditions in vitro (Supplemental Table 1, sheet E). Unexpectedly, we observed robust retention of intron 2 in mRNA from KN99 and C8, with transcript levels like those of the neighboring exons and no indication that it is spliced out during mRNA maturation (Figure 7C); results for H99 were similar. Based on these studies, we concluded that the reference annotation of PDE2 in H99 and KN99 is incorrect – the region annotated as intron 2 is not in fact an intron. This region in C8 (including the extra 4 bp) is a multiple of three in length, so its presence in the mRNA does not alter the downstream protein reading frame relative to the current annotation, although the protein gains 70 amino acids corresponding to translation of the intron 2 sequence (Supplemental Figure 11). Notably, all 240 non-laboratory sequences that we have analyzed (193 clinical, 4 veterinary, and 43 environmental; see 20 and Supplemental Table 1, sheet A) include this 4-bp sequence. In contrast, the lack of these 4 bp in KN99 shifts the reading frame, so that the encoded protein terminates after 461 amino acids, compared to 1,223 for C8 (Supplemental Figure 11). This truncation, which is well before the predicted active site, would further explain why the lung burden in mice infected with pde2Δ is the same as that in mice infected with the parent KN99 (Supplemental Figure 4, lower panel): neither strain produces active Pde2.
To directly examine the protein products, we N-terminally FLAG-tagged Pde2 in identical drug-marked KN99 strains in which PDE2 had either no C8 variants, all 3 C8 variants, or the C8 intron 2 insertion variant alone. As expected, we observed full-length protein in both strains with the C8 intron variant (Fig. 7D, orange triangle). The strain with all three KN99 alleles had no such signal, although a faint band was visible at the expected size of the truncated protein (Fig. 7D, blue triangle; see Discussion). These observations were further supported by mass spectrometry: no peptides downstream of the intron insertion variant were detected in KN99, although they were in C8 (Supplemental Figure 11).
Our results suggest that the presence of active Pde2 in C8 has a significant effect on virulence. Since this protein acts to cleave cAMP, we compared the cAMP levels of the two parents. Indeed, cAMP was significantly higher in KN99 than in C8, consistent with the lack of an active phosphodiesterase (11.8 ± 1.05 versus 4.19 ± 0.33 pg/106 cells; P <0.0001 by t-test).
DISCUSSION
We set out to discover, in an unbiased way, naturally occurring sequence variants in the C. neoformans genome that influence virulence. Our goal is to identify new sequences of interest, which can become a focus of direct research attention. By working at nucleotide resolution, we can gain mechanistic understanding of how virulence is influenced by specific sequence changes, whether they occur in novel or previously characterized genes. Our strategy further has the potential to identify key variants located in regulatory elements or essential genes, unlike studies that rely on gene deletion.
We used two approaches to discover sequence variants of interest, applying both to a population of recombinant progeny derived from a cross between a clinical strain (C8) and a laboratory strain (KN99). One approach was bulked segregant analysis (BSA), for which we compared genome sequence of recombinant pools either grown non-selectively in vitro or recovered from mouse lungs after intranasal inoculation. The second was to analyze the association between sequence and virulence, with the latter measured by fungal lung burden in the same animal model. The two methods showed excellent agreement in identifying a region of chromosome 2 where the C8 sequence favored higher lung burden compared to the same region of KN99. We then used sequence swap experiments to identify a gene within this region that increased the virulence of C8, and, ultimately, to narrow the key region to a single 4-nucleotide difference.
The power of our initial analysis came from combining two completely distinct methods. Association analysis was based on infecting mice with individual C. neoformans strains, with a readout of lung burden determined by plating lung homogenates. In contrast, BSA was based on infection with pooled strains, with a readout of allele frequency determined by whole genome sequencing. BSA has considerable methodological advantages. First, it uses fewer mice (up to two orders of magnitude), which is of both ethical and practical significance. Second, due to the efficiency of pooling strains and the capacity of genome sequencing, BSA requires less experimental work, even beyond that related to the animal studies. For these reasons, once we established the robust agreement between the two methods, we moved to BSA analysis alone.
Drug resistance markers are a convenient tool for strain engineering in C. neoformans 48,49,51. However, insertion adjacent to genes of interest may reduce baseline strain virulence 52, as occurred when we inserted a marker upstream of VPS45 (Figure 4). We hypothesize that this is because of interference with unrecognized regulatory sequences, a conjecture supported by our RNA-seq data (Supplemental Table 1, sheet E). We also used marker cassettes at each end of IR-1 in the parental strains to force recombination within this region. While this strategy was successful, our analysis suggests two cautionary notes. First, in addition to enhanced recombination between marker sites, these studies yielded a high level of aneuploidy of this chromosome in cross progeny (30%), likely due to the pressure imposed by double drug selection. (Our initial cross progeny, isolated without drug selection, were only 3% aneuploid.) In principle, the frequency of these strains in the starting or final pools should not affect the difference in frequency between alleles, since each aneuploid strain contains both alleles; aneuploidy may thus reduce statistical power but should not lead to P-value inflation. Our analysis of the same strain set with and without the aneuploid progeny indeed confirmed that their presence did not alter our results. However, in applications where the presence of aneuploids is a concern, strains generated by ‘forced’ recombination in this manner should be subjected to whole genome sequencing and analysis of copy number variation. Second, a recombination hotspot between our two sites of marker insertion yielded multiple recombinants with the same breakpoint. For future studies, it would be advantageous to avoid such insertion sites, to obtain more evenly distributed recombination events.
Our studies of C. neoformans recombinants show that each parental strain harbors multiple significant variants that influence virulence in both directions. For crosses of C8 and KN99, the genomic region with the largest change in allele frequency during infection happened to be one in which the C8 sequence favored higher lung burden, even though the C8 strain overall is less virulent than KN99. When we investigated individual sequences within this region, we found genes where the C8 allele favored virulence (e.g. PDE2), the KN99 allele favored virulence (e.g. VPS45), or neither favored virulence (e.g. CKF44_03619). All these patterns can occur within the same identified region because the sequences are genetically linked. Our refinement of IR-1 yielded more concordant patterns, with the exclusion of VPS45 from the region where C8 favored virulence. In theory, using enough recombinants would allow resolution of individual genes; in practice, experimental factors will dictate the balance between the effort expended to generate and analyze recombinants and the effort required to dissect an identified region of interest through allele swapping. Such factors may include the mating efficiency of specific strains and the efficiency of cryptococcal genome engineering, which has recently advanced through use of CRISPR 53,54.
The gene we identified in IR-1 for which the KN99 allele favored virulence, VPS45, is robustly expressed in human CSF 55,56 and in multiple in vitro conditions relevant to virulence 57. This gene encodes a homolog of the S. cerevisiae Vps45, which acts in the regulation of vesicular transport 58. Cells completely lacking Vps45 were recently characterized in C. neoformans and shown to be impaired in iron uptake, mitochondrial function, and surface properties that are key factors in virulence 47. Our virulence studies suggest that the C8 variants in this sequence compromise Vps45 activity, either by altering gene expression or changing protein structure. Future examination of these swap strains could potentially define this mechanistically.
We were surprised that our virulence-based analysis yielded PDE2, since this gene had been reported to play a minimal role in cryptococcal expression of virulence factors 50 and a deletion strain from the Madhani collection 4 showed no altered virulence in our animal model. This mystery was solved by our discovery that the PDE2 transcript in the laboratory strains used for these studies encodes only a truncated form of the protein, which lacks the phosphodiesterase domain, so deletion of the gene would not be expected to alter phenotype. (This truncated protein is also low abundance, suggesting it is degraded.) Consistent with the absence of active Pde2, the level of cAMP in KN99 is significantly higher than that of the clinical strain C8. Nonetheless, we cannot rule out the possibility that Pde2 plays additional cellular roles, independent of cAMP.
Although the cryptococcal literature suggests that higher cAMP generally favors the development of virulence factors, these effects are highly pleiotropic 59 and cAMP levels are subject to complex regulation through multiple pathways 60,61. Compensatory changes in KN99 may also mitigate the effects of Pde2’s absence, potentially including feedback mechanisms, as suggested by the increased transcription of PDE2 in KN99 compared to C8 under host-like conditions (Supplemental Table 1, sheet E) even though this mRNA does not encode a functional phosphodiesterase. Overall, the common association of higher cAMP levels with virulence may need to be refined.
We suspect that the defective KN99 allele appeared near the time of isolation of its progenitor strain H99 41, because it occurs in multiple lineages originating in this isolate 16,34 but not in any of the 240 clinical or environmental isolate sequences that we examined. (The original H99 patient isolate is not available for sequencing 34.) This is the second virulence-altering genomic change that has now been associated with common laboratory strains of C. neoformans; prior studies showed that one branch of the H99 lineage, which includes the most prevalent model strains, has increased virulence due to partial deletion of SGF29 34,62. These differences must be kept in mind as future genome-wide projects are pursued in this organism; while well-developed reference strains and their derivatives are a tremendous tool for research, they do not always accurately represent clinical isolates 63.
Our strategy, and the libraries of recombinants we have collected, are powerful tools for the unbiased analysis of cryptococcal traits of biological and medical interest at the sequence level. Any characteristic of interest that is measurable and varies within the recombinant population is amenable to this analysis, even if it is fairly similar in the original parents. Such studies, in our lab and others, will identify new targets for investigation and lead to increased mechanistic understanding of an important fungal pathogen of humans.
STAR METHODS
RESOURCE AVAILABILITY
Lead contact
Requests for resources or information should be directed to the lead contact, Tamara Doering (doering@wustl.edu)
Materials availability
All engineered strains generated in this study are available on request from the lead contact.
Data and code availability
Whole genome sequence data have been deposited in the Sequence Read Archive (SRA) as Project number PRJNA967729 and are publicly available as of the date of publication; accession numbers are listed in Supplemental Table 1 as noted in the key resources table. RNA-seq data have been deposited at Gene Expression Omnibus (GEO) as project number GSE232437 and are publicly available as of the date of publication. Accession numbers are listed in Supplemental Table 1 as noted in the key resources table. Proteomics data is available through PRIDE: Proteome Xchange under accession number PXD045163 and will be made available to the public once the paper is published with a DOI.
Key resources table.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| Monoclonal ANTI-FLAG® M2 antibody | Sigma-Aldrich | Cat# F3165; RRID: AB_259529 |
| IRDye® 680RD Goat anti-Mouse IgG Secondary Antibody | LI-COR Biosciences | Cat# 926-68070; RRID: AB_10956588 |
| Bacterial and virus strains | ||
| Cryptococcus neoformans: KN99a and KN99α (38) | Joe Heitman, Duke University | [] |
| Cryptococcus neoformans: clinical and environmental isolates | John Perfect, Duke University | This paper, Supplemental Table 1A |
| Cryptococcus neoformans: clinical isolates | CINCH Consortium | This paper, Supplemental Table 1A |
| Cryptococcus neoformans: gene deletion library (4) | Fungal Genetics Stock Center | Madhani plates (2015, 2016, 2020) |
| Cryptococus neoformans: engineered strains | Authors | This paper, Supplemental Table 1B and 1D |
| Cryptococcus neoformans: recombinant strains | Authors | This paper, Supplemental Table 1B |
| Chemicals, peptides, and recombinant proteins | ||
| Yeast extract | Fisher Scientific | Cat# DF0127179 |
| Bacto peptone | ThermoFisher | Cat# 211677 |
| Phosphate buffered saline | Fisher Scientific | Cat# MT21040CM |
| V8 juice | Campbell Soup Company | |
| RPMI medium | Sigma-Aldrich | Cat# R8758-6X500ML |
| Mouse serum | BioIVT | Cat# MSE01SRMUN5, lot # MSE308112 |
| 13C-5-adenosine cAMP | Toronto Research Chemicals | Cat # A280457 |
| Critical commercial assays | ||
| TRIzol reagent | Ambion | Cat# 15-596-018 |
| NEBNext Ultra RNA Library Prep Kit for Illumina | New England Biolabs | Cat# E7420L |
| NEBNext Poly(A) mRNA Magnetic Isolation Module | New England Biolabs | Cat# E7490L |
| NEBNext Ultra II DNA Library Prep Kit for Illumina | New England Biolabs | Cat# E7645L |
| Deposited data | ||
| Whole genome sequence data | Sequence Read Archive project number PRJNA967729 | Accession numbers listed in Supplementary Table 1A–D |
| RNA-seq data | Gene Expression Omnibus (GEO) project number GSE232437 | Accession numbers listed in Supplementary Table 1E |
| Proteomics data | PRIDE: Proteome Xchange | Accession number: PXD045163 |
| Experimental models: Organisms/strains | ||
| Mice: C57Bl/6 | Jackson Laboratory | Cat#000664 RRID:IMSR_JAX:000664 |
| Software and algorithms | ||
| GraphPad Prism 9 | GraphPad Software, Inc. | |
| NextGenMap | Sedlazeck et al. (64) | https://cibiv.github.io/NextGenMap/ |
| SAMtools | Li et al. (65) | http://www.htslib.org/ |
| Lumpy | Layer et al. (66) | https://github.com/arq(5)x/lumpy-sv |
| Yaha | Faust et al. (67) | https://github.com/GregoryFaust/yaha |
| MergeSamFiles from Picard tools version 2.10.0 | Broad Institute | https://github.com/broadinstitute/picard |
| FreeBayes version 1.1.0 | Garrison et al. (68) | https://github.com/freebayes/freebayes |
| SNPEff version 4.3.1 | Cingolani et al. (69) | https://pcingola.github.io/SnpEff/ |
| CNVnator 0.3.2 | Abyzov et al. (70) | https://github.com/abyzovlab/CNVnator |
| QTLseqr | Mansfield et al. (71) | https://github.com/bmansfeld/QTLseqr |
| FastQC | Wingett et al. (75) | https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ |
| Hisat2 version 2.2.1 | Kim et al. (76) | http://daehwankimlab.github.io/hisat2/ |
| Subread 2.0.0 | Liao et al. (77) | https://rnnh.github.io/bioinfo-notebook/docs/featureCounts.html |
| DESeq2 | Love et al. (78) | https://bioconductor.org/packages/release/bioc/html/DESeq2.html |
| Bulk Segregant Analysis (BSA) and Association Analysis (AA) | This paper | https://github.com/DanielPAagustinho/BSA_crypto_analysis |
| IHW (independent hypothesis weighting) package | Ignatiadis et al. (79) | https://github.com/nignatiadis/IHW |
| MaxQuant software (version 2.0.3) | Cox et al. (82) | https://maxquant.net |
All original code has been deposited at Github and is publicly available as of the date of publication. DOI information is listed in the key resources table.
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
EXPERIMENTAL MODEL AND STUDY PARTICIPANT DETAILS
Cryptococcus neoformans: strains and growth conditions
The strains used in this study are listed in Table S1. C. neoformans strains were cultured in yeast peptone dextrose (YPD, Difco) at 30 °C with shaking (230 rpm) unless indicated otherwise. For colony forming unit assessment, strains were plated on YPD agar and incubated overnight at 30 °C. Where appropriate, 0.1 mg/ml of nourseothricin sulfate (G Biosciences) or G418 (Gibco) was included in the growth medium. For RNA-seq experiments, cells were grown in RPMI + 10% mouse serum at 37°C, 5% CO2. For mating, colonies were mixed on V8 medium pH 5.0 (per liter: 50 ml V8 juice, 25.7 ml 0.2M Na2HPO4, 24.3 ml 0.1 M citric acid, 40 g agar, and 2 ml 25 mM CuSO4 (added after autoclaving)) and kept at room temperature in the dark for 2 weeks. For cAMP analysis, cells were grown in RPMI with 2% mouse serum as described in the detailed methods.
Mice: ethics statement, strain, and care
All mouse experiments were performed in accordance with NIH guidelines, the Animal Welfare Act, and US federal law and were approved by the Washington University Institutional Animal Care and Use Committee (IACUC) under protocol 23–0058.
Six-week-old female C57BL/6J mice were obtained from Jackson Laboratory. Mice were housed in a barrier facility with food and water provided ad libitum. Animal care was overseen by the Washington University Division of Comparative Medicine (DCM). Although there is no reported data to suggest that C. neoformans infection results in our animal model differ between female and male mice, we recognize that using only female mice may limit generalizability of this study.
METHOD DETAILS
Strains and growth
KN99 strains were obtained from Joe Heitman (Duke University) and clinical strains from John Perfect (Duke University) and from the CINCH consortium; see Supplemental Table 1 for details. C. neoformans deletion strains generated by the Madhani group 4 were obtained from the ATCC. For experiments, strains were streaked from storage at −80 °C onto YPD plates, incubated for two days at 30 °C, inoculated from single colonies into YPD liquid cultures, and grown at 30 °C with shaking (220 rpm) unless indicated otherwise.
Virulence studies
For all infections, overnight cultures of cells grown as above were washed three times in PBS, adjusted to 2.5 × 105 cells/ml, and used for intranasal inoculation of 6-week-old female C57/Bl6 mice (see Figure legends for specific inocula) and for plating to confirm viable cells in the inoculum. To measure organ burden, mice were sacrificed at the times indicated in the text and organs were harvested, homogenized, and plated (YPD agar, 30 °C, 2 days) for enumeration of colony forming units. To assess survival after infection, mice were weighed daily and sacrificed if their weight reached 80% of initial weight, if they showed signs of illness, or at the end of the study. Statistical differences in organ burden and survival were assessed by one-way ANOVA with Tukey’s multiple comparisons test post hoc and Log-rank (Mantel-Cox) test, respectively, using GraphPad Prism9.
Crosses and spore isolation
To assess filamentation, single colonies of strains to be tested were mixed with single colonies of either KN99a or KN99α on V8 medium pH 5.0 (per liter: 50 ml V8 juice, 25.7 ml 0.2M Na2HPO4, 24.3 ml 0.1 M citric acid, 40 g agar, and 2 ml 25 mM CuSO4 (added after autoclaving)) and incubated for 14 days at room temperature in the dark before examination on a dissecting microscope. For individual strains that filamented poorly under these conditions, additional test crosses were performed on V8 plates adjusted to pH 7.0 by using 0.5 g KH2PO4 in place of the Na2HPO4 and citric acid solutions. For drug selection of recombinant progeny, cells were similarly crossed, and progeny plated on double drug plates. Colonies were then passaged three times on drug plates, once on YPD, and frozen in YPD. For spore microdissection, cells of each parent were grown as above, washed twice in PBS, diluted to OD600 of 1, and spotted on V8 plates (medium made as above but passed over a 70 μm pore strainer before plating).
Genome sequencing and analysis
For gDNA isolation, cells were either grown overnight in YPD as above or recovered from BSA studies as below. DNA was then isolated and sequenced as in 40, except that sonication was to an average size of 300 bp and sequencing was on an Illumina HiSeq-2500 (for paired end 150-bp reads). Reads were aligned to the KN99α ASM221672 reference genome 40 using NextGenMap 64 with the -X 100000000 parameter. Output SAM files were converted to BAM and PCR duplicates were removed using the <monospace>view</monospace> and <monospace>rmdup</monospace> commands from SAMtools 1.7 65, respectively. Unmapped and soft clipped reads (with at least 20 nucleotides) were extracted using the split_unmapped_to_fasta.pl script from the Lumpy package 66 with the -b 20 parameter and realigned using the split-read aligner Yaha 67 with the -M 15 -H 2000 -L 11 parameters. The outputs of NextGenMap and Yaha were merged using the script MergeSamFiles from Picard tools version 2.10.0 (https://github.com/broadinstitute/picard) and indexed with the <monospace>index</monospace> function from SAMtools.
Variant calling
SNVs and small indels were identified using FreeBayes version 1.1.0 68 with the parameters -F 0.75 -! 5 -p 1 -m 30. Variant annotation of the resulting VCF files was performed with SNPEff version 4.3.1 69. Copy-number variants (CNVs) were called with CNVnator 0.3.2 70 adjusted to use non-overlapping 100-bp windows and using default parameters. As in 40, each CNV was assigned a mean depth of coverage relative to the genome-wide depth of coverage and considered to be a duplication if the normalized depth was at least 1.9 times the genome-wide average for the same strain or a deletion if the normalized depth was below 0.25 times the genome-wide average.
Bulked segregant analysis (BSA)
For BSA, strains for analysis were streaked from frozen stocks as above, inoculated into 600 μl of YPD in 96-well deep-well plates, covered with Breathe-Easy film, and grown overnight (30 °C, 500 rpm). Equal volumes from each well were combined as specified below, in biological duplicate, and aliquots of the resulting pools were used for three purposes as follows: (1) immediately reserved to represent the initial pool; (2) grown in YPD (5 × 104 cells in 4 ml, 30 °C, 220 rpm, 20 h); (3) used to infect mice as above. After 9 or 15 days, lungs were harvested and homogenized in 3 ml of DNase I buffer (10 mM Tris-HCl, 2.5 mM MgCl2, 0.g mM CaCl2, pH 7.5) containing 2 mg/ml DNAse I (Thermo Scientific) and the homogenate was filtered to remove host tissue fragments using a cell strainer with 40 μm pores. The filtrate was subjected to centrifugation (3000×g, 7 min, RT) and the pellet was resuspended and incubated for 5 minutes in 2 ml of 1% SDS to lyse remaining host cells. The suspension was then diluted 10-fold in distilled water, centrifugation and SDS incubation were repeated, and the fungal cells were washed twice with distilled water.
DNA was prepared from all samples and sequenced using Illumina technology as above (average coverage 100.2-fold). Reads from replicate pairs were combined and the number of reads matching each parent’s allele at each variable site was used as an estimate of allele frequency. Variants were filtered, and only those found in the original C8 parent sequence were kept. We next calculated the change in allele frequency at each variable site for growth in YPD or in mice relative to the initial pool sample. We also calculated a genome-wide significance P-value with the R package QTLseqr 71, applying the G’ method46 to each pool individually and to the combined reads from all pools. Data was plotted as changes in allele frequency, with smoothing as in reference 71 when indicated in the text. BSA #1 included 100 strains derived from a KN99a × C8 cross, randomly assorted into 5 pools of 40 strains each, with each strain present in 2 pools and each pair of pools sharing 10 strains. BSA #2 included 221 new strains derived from a KN99a-G418 × C8-NAT cross. These strains were divided in 5 pools containing 87–88 strains each, with each strain present in 2 different pools. A sixth pool included all of the strains together.
Association Analysis (AA)
Individual mice were infected with KN99 and sequenced recombinant strains, with 136 infections performed in three groups with KN99 controls in each. CFU were assessed at 9 days as above, using lung burden (log2 fold-change in CFU compared to KN99) as a surrogate for virulence. At each variant position in the genome, we compared the virulence of strains with either KN99 or C8 alleles by calculating the t-statistic. Null distributions for P-values were obtained by permuting the assignment of CFU phenotypes to strains 10,000 times, calculating the largest t value genome-wide, and using the distribution of those largest t-values. This method accounts for multiple hypothesis testing.
Strain engineering
For sequence swap experiments, we engineered strains using biolistic transformation and the split marker strategy described in 72. Selection was mediated by a nourseothricin (NAT) resistance marker 48 that was inserted adjacent to the gene of interest (GOI) at the end nearest the variants to be altered, avoiding putative promotor or terminator regions. One fragment used for transformation therefore included one end of the marker gene fused to flanking and coding region of the GOI by PCR; the other consisted of the rest of the marker gene (including a 271 bp overlap) and sequence farther away from the GOI. All candidate transformants were assessed by WGS as above to select strains that had undergone recombination to yield the desired change in variant. For protein tagging, we engineered strains in PDE2 swap strain backgrounds using short-homology directed CRISPR 53. Briefly, DNA encoding a G418 resistance cassette and N-terminal 3X-FLAG tagging sequence was targeted via 50-bp homologous regions to the NAT resistance cassette of each swap strain. Colonies were screened for successful replacement of the NAT cassette with this construct by patterns of drug resistance, PCR, and directed sequencing of the region. Details of specific strain constructions are available on request.
RNA seq and analysis
RNA-seq was performed as in Reuwsaat et al 73 with minor differences. RNA was isolated from parental or engineered strains grown for 24 hours in RPMI + 10% mouse serum (37°C, 5% CO2) and sequenced as previously described 74. Briefly, cDNA samples were sequenced using the Illumina Nextseq platform for paired-end 2 × 150 bp reads and read quality was evaluated by FastQC 75. Fastq files were aligned to the KN99α genome 40 using Hisat2 version 2.2.1 76 with the default parameters plus --max-intronlen 2200. SAM files were converted to bam, reads were sorted and indexed, and read duplicates were removed from the final bam files using SAMtools 1.7 65. The number of reads mapped per gene was calculated using featureCounts from the package Subread 2.0.0 77 and differential gene expression was analyzed with DESeq2 78, using the IHW (independent hypothesis weighting) package to calculate adjusted p-values 79.
Protein analysis
For immunoblotting, cells were grown for 18 h in YPD as above, washed, and broken by bead beating (four 1-min rounds, alternating with 1 min on ice). The crude lysates were resolved by SDS-PAGE, transferred to nitrocellulose, and probed with 8 μg/mL anti-FLAG antibody M2 (Sigma) followed by 0.1 μg/mL goat anti-mouse secondary antibody (Licor).
For proteomics, cells were similarly grown overnight, diluted to 0.6 OD600 in YPD, grown for 3 h, collected, and washed with PBS before sample preparation as previously described 80. Briefly, cells were mechanically lysed by probe sonication, treated with dithiothreitol and iodoacetamide, and precipitated in acetone overnight at −20°C. The isolated proteins were then quantified and digested with LysC-trypsin (Promega [protein/enzyme ratio, 50:1]). The resulting peptides were purified using C18 stop-and-go-extraction tips (StageTips) 81 prior to resuspension in buffer A (0.1% formic acid) and analysis on an Orbitrap Exploris 240 hybrid quadrupole-orbitrap mass spectrometer (Thermo Fisher Scientific) coupled to a Vanquish Neo UHPLC system (Thermo Fisher Scientific). Peptides were trapped and then separated on a 75 μm × 50 cm PepMap RSLC EASY-Spray column (Thermo Fisher Scientific). Separated peptides were electrosprayed into the mass spectrometer with a gradient of 4% to 45% buffer B (80% acetonitrile, 0.1% formic acid) over 2 h, followed by a wash with 100% buffer B with a 300-nL/min flow rate. The mass spectrometer was operated in data-dependent acquisition mode switching between one full scan and subsequent MS/MS scans of abundant peaks with a 2s cycle time. Full scans (m/z 375 to 1,500) and MS/MS scans were acquired in the Orbitrap mass analyzer with a resolution of 60,000 and 15,000 at 200 m/z, respectively. Output from the mass spectrometer was processed using MaxQuant software (version 2.0.3) 82 as previously described 80 and the results used to search a modified C. neoformans var. grubii serotype A (strain H99/ATCC 208821) proteome (7,429 sequences; 24 July 2023) acquired from Uniprot; the FASTA file included the Pde2 protein sequences of interest.
cAMP analysis
Strains were grown as above, washed in RPMI with 2% mouse serum that had been preconditioned at 37 °C and 5% CO2, resuspended in 20 ml of the same medium at 107 cells/ml, and grown in the same conditions for 24 h. Cells were then counted again and 1-ml aliquots were collected by sedimentation, flash frozen, and submitted to the Washington University School of Medicine Metabolomics Facility for liquid chromatography/mass spectrometry analysis of cAMP relative to a 13C-5-adenosine cAMP internal standard (Toronto Research Chemicals).
QUANTIFICATION AND STATISTICAL ANALYSIS
For survival studies, groups of 8 mice were tested in at least two independent experiments and analyzed by Log-rank test. For organ burden studies, results from groups of 3–8 mice (specified in the legend for each figure that presents such data) were presented as mean −/+ standard deviation and compared by one-way ANOVA with Tukey’s multiple comparisons test post hoc (Figure 4; Supplemental Figures 4 and 6) or by t-test (Figures 6 and 7; Supplemental Figures 2, 5, and 6). For these experiments statistical analysis was performed using GraphPad Prism 9 software. Figure 1 was prepared with the same software, but no statistical tests were applied because this one-time survey was not used to make conclusions about significance. For association analysis, statistical analysss was based on all 136 mice used and performed using custom software (code posted on Github; link in the Key Resources Table) based on the t statistic and a permutation-based null model. For bulk segregant analysis, the C8 and KN99 allele for each variable site were compared in a total of at least 10 mice across two independent experiments. Analysis was performed using QTLseqr software and the G’ statistic as detailed in the Methods.
Supplementary Material
Highlights.
Bulk segregant analysis revealed sequence variants that influence fungal virulence.
Targeted variant swaps between strains showed reciprocal shifts in pathogenicity.
Common lab strains have a defect in phosphodiesterase activity that reduces virulence.
ACKNOWLEDGEMENTS
We thank John Perfect, Andrej Spec, and the CINCH consortium for generously providing clinical strains. We appreciate the assistance of Chase Mateusiak with computational analysis, Alyssa Brunsmann with mouse studies, Abigail Kimball with computational method assessment, Arjun Sukumaran with mass spectrometry, and Elizabeth Nordmark, Gaby Altman, and Michael Lin with spore isolation. We thank Barak Cohen for suggesting forced recombination, members of the Doering lab for stimulating discussion, and Keeley Choy, Daphne Ko, and Liza Loza for comments on the manuscript.
INCLUSION AND DIVERSITY
One or more of the authors of this paper self-identifies as an underrepresented ethnic minority in their field of research or within their geographical location. One or more of the authors of this paper self-identifies as a gender minority in their field of research.
Footnotes
DECLARATION OF INTERESTS
The authors declare no competing interests.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
REFERENCES
- 1.Rajasingham R, Govender NP, Jordan A, Loyse A, Shroufi A, Denning DW, Meya DB, Chiller TM, and Boulware DR (2022). The global burden of HIV-associated cryptococcal infection in adults in 2020: a modelling analysis. The Lancet. Infectious diseases 22, 1748–1755. 10.1016/S1473-3099(22)00499-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Loftus BJ, Fung E, Roncaglia P, Rowley D, Amedeo P, Bruno D, Vamathevan J, Miranda M, Anderson IJ, Fraser JA, et al. (2005). The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science 307, 1321–1324. 10.1126/science.1103773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Haynes BC, Skowyra ML, Spencer SJ, Gish SR, Williams M, Held EP, Brent MR, and Doering TL (2011). Toward an integrated model of capsule regulation in Cryptococcus neoformans. PLoS Pathog 7, e1002411. 10.1371/journal.ppat.1002411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Liu OW, Chun CD, Chow ED, Chen C, Madhani HD, and Noble SM (2008). Systematic genetic analysis of virulence in the human fungal pathogen Cryptococcus neoformans. Cell 135, 174–188. S0092-8674(08)01012-X [pii] 10.1016/j.cell.2008.07.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Jung KW, Yang DH, Maeng S, Lee KT, So YS, Hong J, Choi J, Byun HJ, Kim H, Bang S, et al. (2015). Systematic functional profiling of transcription factor networks in Cryptococcus neoformans. Nature communications 6, 6757. 10.1038/ncomms7757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lee KT, So YS, Yang DH, Jung KW, Choi J, Lee DG, Kwon H, Jang J, Wang LL, Cha S, et al. (2016). Systematic functional analysis of kinases in the fungal pathogen Cryptococcus neoformans. Nature communications 7, 12766. 10.1038/ncomms12766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lin X, Huang JC, Mitchell TG, and Heitman J (2006). Virulence attributes and hyphal growth of C. neoformans are quantitative traits and the MATalpha allele enhances filamentation. PLoS Genet 2, e187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Vogan AA, Khankhet J, Samarasinghe H, and Xu J (2016). Identification of QTLs Associated with Virulence Related Traits and Drug Resistance in Cryptococcus neoformans. G3 6, 2745–2759. 10.1534/g3.116.029595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cuomo CA, Rhodes J, and Desjardins CA (2018). Advances in Cryptococcus genomics: insights into the evolution of pathogenesis. Mem Inst Oswaldo Cruz 113, e170473. 10.1590/0074-02760170473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Litvintseva AP, Marra RE, Nielsen K, Heitman J, Vilgalys R, and Mitchell TG (2003). Evidence of sexual recombination among Cryptococcus neoformans serotype A isolates in sub-Saharan Africa. Eukaryot Cell 2, 1162–1168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Litvintseva AP, Thakur R, Vilgalys R, and Mitchell TG (2006). Multilocus sequence typing reveals three genetic subpopulations of Cryptococcus neoformans var. grubii (serotype A), including a unique population in Botswana. Genetics 172, 2223–2238. 10.1534/genetics.105.046672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Litvintseva AP, Lin X, Templeton I, Heitman J, and Mitchell TG (2007). Many globally isolated AD hybrid strains of Cryptococcus neoformans originated in Africa. PLoS Pathog 3, e114. 10.1371/journal.ppat.0030114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wiesner DL, Moskalenko O, Corcoran JM, McDonald T, Rolfes MA, Meya DB, Kajumbula H, Kambugu A, Bohjanen PR, Knight JF, et al. (2012). Cryptococcal genotype influences immunologic response and human clinical outcome after meningitis. MBio 3. 10.1128/mBio.00196-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Litvintseva AP, and Mitchell TG (2012). Population genetic analyses reveal the African origin and strain variation of Cryptococcus neoformans var. grubii. PLoS Pathog 8, e1002495. 10.1371/journal.ppat.1002495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ormerod KL, Morrow CA, Chow EW, Lee IR, Arras SD, Schirra HJ, Cox GM, Fries BC, and Fraser JA (2013). Comparative Genomics of Serial Isolates of Cryptococcus neoformans Reveals Gene Associated With Carbon Utilization and Virulence. G3 3, 675–686. 10.1534/g3.113.005660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Janbon G, Ormerod KL, Paulet D, Byrnes EJ 3rd, Yadav V, Chatterjee G, Mullapudi N, Hon CC, Billmyre RB, Brunel F, et al. (2014). Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation. PLoS Genet 10, e1004261. 10.1371/journal.pgen.1004261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Farrer RA, Desjardins CA, Sakthikumar S, Gujja S, Saif S, Zeng Q, Chen Y, Voelz K, Heitman J, May RC, et al. (2015). Genome Evolution and Innovation across the Four Major Lineages of Cryptococcus gattii. MBio 6, e00868–00815. 10.1128/mBio.00868-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Beale MA, Sabiiti W, Robertson EJ, Fuentes-Cabrejo KM, O’Hanlon SJ, Jarvis JN, Loyse A, Meintjes G, Harrison TS, May RC, et al. (2015). Genotypic Diversity Is Associated with Clinical Outcome and Phenotype in Cryptococcal Meningitis across Southern Africa. PLoS Negl Trop Dis 9, e0003847. 10.1371/journal.pntd.0003847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chen Y, Farrer RA, Giamberardino C, Sakthikumar S, Jones A, Yang T, Tenor JL, Wagih O, Van Wyk M, Govender NP, et al. (2017). Microevolution of Serial Clinical Isolates of Cryptococcus neoformans var. grubii and C. gattii. mBio 8. 10.1128/mBio.00166-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Rhodes J, Desjardins CA, Sykes SM, Beale MA, Vanhove M, Sakthikumar S, Chen Y, Gujja S, Saif S, Chowdhary A, et al. (2017). Tracing Genetic Exchange and Biogeography of Cryptococcus neoformans var. grubii at the Global Population Level. Genetics 207, 327–346. 10.1534/genetics.117.203836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ashton PM, Thanh LT, Trieu PH, Van Anh D, Trinh NM, Beardsley J, Kibengo F, Chierakul W, Dance DAB, Rattanavong S, et al. (2019). Three phylogenetic groups have driven the recent population expansion of Cryptococcus neoformans. Nature communications 10, 2035. 10.1038/s41467-019-10092-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ma H, and May RC (2010). Mitochondria and the regulation of hypervirulence in the fatal fungal outbreak on Vancouver Island. Virulence 1, 197–201. 11053 [pii] 10.4161/viru.1.3.11053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sabiiti W, Robertson E, Beale MA, Johnston SA, Brouwer AE, Loyse A, Jarvis JN, Gilbert AS, Fisher MC, Harrison TS, et al. (2014). Efficient phagocytosis and laccase activity affect the outcome of HIV-associated cryptococcosis. J Clin Invest 124, 2000–2008. 10.1172/JCI72950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Fernandes KE, Brockway A, Haverkamp M, Cuomo CA, van Ogtrop F, Perfect JR, and Carter DA (2018). Phenotypic Variability Correlates with Clinical Outcome in Cryptococcus Isolates Obtained from Botswanan HIV/AIDS Patients. MBio 9. 10.1128/mBio.02016-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Altamirano S, Jackson KM, and Nielsen K (2020). The interplay of phenotype and genotype in Cryptococcus neoformans disease. Biosci Rep 40. 10.1042/BSR20190337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Velez N, Vega-Vela N, Munoz M, Gomez P, Escandon P, Ramirez JD, Zaragoza O, Monteoliva Diaz L, and Parra-Giraldo CM (2022). Deciphering the Association among Pathogenicity, Production and Polymorphisms of Capsule/Melanin in Clinical Isolates of Cryptococcus neoformans var. grubii VNI. J Fungi (Basel) 8. 10.3390/jof8030245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mukaremera L, McDonald TR, Nielsen JN, Molenaar CJ, Akampurira A, Schutz C, Taseera K, Muzoora C, Meintjes G, Meya DB, et al. (2019). The Mouse Inhalation Model of Cryptococcus neoformans Infection Recapitulates Strain Virulence in Humans and Shows that Closely Related Strains Can Possess Differential Virulence. Infect Immun 87. 10.1128/IAI.00046-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gerstein AC, Jackson KM, McDonald TR, Wang Y, Lueck BD, Bohjanen S, Smith KD, Akampurira A, Meya DB, Xue C, et al. (2019). Identification of Pathogen Genomic Differences That Impact Human Immune Response and Disease during Cryptococcus neoformans Infection. MBio 10. 10.1128/mBio.01440-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Day JN, Qihui S, Thanh LT, Trieu PH, Van AD, Thu NH, Chau TTH, Lan NPH, Chau NVV, Ashton PM, et al. (2017). Comparative genomics of Cryptococcus neoformans var. grubii associated with meningitis in HIV infected and uninfected patients in Vietnam. PLoS Negl Trop Dis 11, e0005628. 10.1371/journal.pntd.0005628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Andrade-Silva LE, Ferreira-Paim K, Ferreira TB, Vilas-Boas A, Mora DJ, Manzato VM, Fonseca FM, Buosi K, Andrade-Silva J, Prudente BDS, et al. (2018). Genotypic analysis of clinical and environmental Cryptococcus neoformans isolates from Brazil reveals the presence of VNB isolates and a correlation with biological factors. PLoS One 13, e0193237. 10.1371/journal.pone.0193237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Montoya MC, Magwene PM, and Perfect JR (2021). Associations between Cryptococcus Genotypes, Phenotypes, and Clinical Parameters of Human Disease: A Review. J Fungi (Basel) 7. 10.3390/jof7040260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kassaza K, Wasswa F, Nielsen K, and Bazira J (2022). Cryptococcus neoformans Genotypic Diversity and Disease Outcome among HIV Patients in Africa. J Fungi (Basel) 8. 10.3390/jof8070734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Desjardins CA, Giamberardino C, Sykes SM, Yu CH, Tenor JL, Chen Y, Yang T, Jones AM, Sun S, Haverkamp MR, et al. (2017). Population genomics and the evolution of virulence in the fungal pathogen Cryptococcus neoformans. Genome Res 27, 1207–1219. 10.1101/gr.218727.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Arras SDM, Ormerod KL, Erpf PE, Espinosa MI, Carpenter AC, Blundell RD, Stowasser SR, Schulz BL, Tanurdzic M, and Fraser JA (2017). Convergent microevolution of Cryptococcus neoformans hypervirulence in the laboratory and the clinic. Scientific reports 7, 17918. 10.1038/s41598-017-18106-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sephton-Clark P, Tenor JL, Toffaletti DL, Meyers N, Giamberardino C, Molloy SF, Palmucci JR, Chan A, Chikaonda T, Heyderman R, et al. (2022). Genomic Variation across a Clinical Cryptococcus Population Linked to Disease Outcome. mBio 13, e0262622. 10.1128/mbio.02626-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kwon-Chung KJ (1975). A new genus, filobasidiella, the perfect state of Cryptococcus neoformans. Mycologia 67, 1197–1200. [PubMed] [Google Scholar]
- 37.Hsueh Y-P, Lin X, Kwon-Chung J, and Heitman J (2011). Sexual reproduction of Cryptococcus. In Cryptococcus, from Human Pathogen to Model Yeast, Heitman J, Kozel TR, Kwon-Chung J, Perfect J, and Casadevall A, eds. (ASM Press; ), pp. 81–96. [Google Scholar]
- 38.Nielsen K, Cox GM, Wang P, Toffaletti DL, Perfect JR, and Heitman J (2003). Sexual cycle of Cryptococcus neoformans var. grubii and virulence of congenic a and alpha isolates. Infect Immun 71, 4831–4841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Litvintseva AP, and Mitchell TG (2009). Most environmental isolates of Cryptococcus neoformans var. grubii (serotype A) are not lethal for mice. Infect Immun 77, 3188–3195. 10.1128/IAI.00296-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Friedman RZ, Gish SR, Brown H, Brier L, Howard N, Doering TL, and Brent MR (2018). Unintended Side Effects of Transformation Are Very Rare in Cryptococcus neoformans. G3 8, 815–822. 10.1534/g3.117.300357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Perfect JR, Lang SD, and Durack DT (1980). Chronic cryptococcal meningitis: a new experimental model in rabbits. Am J Pathol 101, 177–194. [PMC free article] [PubMed] [Google Scholar]
- 42.Kwon-Chung KJ, and Bennett JE (1978). Distribution of alpha and alpha mating types of Cryptococcus neoformans among natural and clinical isolates. Am J Epidemiol 108, 337–340. 10.1093/oxfordjournals.aje.a112628. [DOI] [PubMed] [Google Scholar]
- 43.del Poeta M, Toffaletti DL, Rude TH, Sparks SD, Heitman J, and Perfect JR (1999). Cryptococcus neoformans differential gene expression detected in vitro and in vivo with green fluorescent protein. Infect Immun 67, 1812–1820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Yadav V, Sun S, and Heitman J (2021). Uniparental nuclear inheritance following bisexual mating in fungi. Elife 10. 10.7554/eLife.66234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Michelmore RW, Paran I, and Kesseli RV (1991). Identification of markers linked to disease-resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations. Proc Natl Acad Sci U S A 88, 9828–9832. 10.1073/pnas.88.21.9828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Magwene PM, Willis JH, and Kelly JK (2011). The statistics of bulk segregant analysis using next generation sequencing. PLoS Comput Biol 7, e1002255. 10.1371/journal.pcbi.1002255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Caza M, Hu G, Nielson ED, Cho M, Jung WH, and Kronstad JW (2018). The Sec1/Munc18 (SM) protein Vps45 is involved in iron uptake, mitochondrial function and virulence in the pathogenic fungus Cryptococcus neoformans. PLoS Pathog 14, e1007220. 10.1371/journal.ppat.1007220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McDade HC, and Cox GM (2001). A new dominant selectable marker for use in Cryptococcus neoformans. Med Mycol 39, 151–154. [DOI] [PubMed] [Google Scholar]
- 49.Hua J, Meyer JD, and Lodge JK (2000). Development of positive selectable markers for the fungal pathogen Cryptococcus neoformans. Clin Diagn Lab Immunol 7, 125–128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Hicks JK, Bahn YS, and Heitman J (2005). Pde1 phosphodiesterase modulates cyclic AMP levels through a protein kinase A-mediated negative feedback loop in Cryptococcus neoformans. Eukaryot Cell 4, 1971–1981. 4/12/1971 [pii] 10.1128/EC.4.12.1971-1981.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Cox GM, Toffaletti DL, and Perfect JR (1996). Dominant selection system for use in Cryptococcus neoformans. J Med Vet Mycol 34, 385–391. [PubMed] [Google Scholar]
- 52.Wang ZA, Griffith CL, Skowyra ML, Salinas N, Williams M, Maier EJ, Gish SR, Liu H, Brent MR, and Doering TL (2014). Cryptococcus neoformans dual GDP-mannose transporters and their role in biology and virulence. Eukaryot Cell 13, 832–842. 10.1128/EC.00054-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Huang MY, Joshi MB, Boucher MJ, Lee S, Loza LC, Gaylord EA, Doering TL, and Madhani HD (2022). Short homology-directed repair using optimized Cas9 in the pathogen Cryptococcus neoformans enables rapid gene deletion and tagging. Genetics 220. 10.1093/genetics/iyab180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Fan Y, and Lin X (2018). Multiple Applications of a Transient CRISPR-Cas9 Coupled with Electroporation (TRACE) System in the Cryptococcus neoformans Species Complex. Genetics 208, 1357–1372. 10.1534/genetics.117.300656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Chen Y, Toffaletti DL, Tenor JL, Litvintseva AP, Fang C, Mitchell TG, McDonald TR, Nielsen K, Boulware DR, Bicanic T, and Perfect JR (2014). The Cryptococcus neoformans transcriptome at the site of human meningitis. MBio 5, e01087–01013. 10.1128/mBio.01087-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yu CH, Sephton-Clark P, Tenor JL, Toffaletti DL, Giamberardino C, Haverkamp M, Cuomo CA, and Perfect JR (2021). Gene Expression of Diverse Cryptococcus Isolates during Infection of the Human Central Nervous System. mBio 12, e0231321. 10.1128/mBio.02313-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Basenko EY, Pulman JA, Shanmugasundram A, Harb OS, Crouch K, Starns D, Warrenfeltz S, Aurrecoechea C, Stoeckert CJ Jr., Kissinger JC, et al. (2018). FungiDB: An Integrated Bioinformatic Resource for Fungi and Oomycetes. J Fungi (Basel) 4. 10.3390/jof4010039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Nichols BJ, and Pelham HR (1998). SNAREs and membrane fusion in the Golgi apparatus. Biochim Biophys Acta 1404, 9–31. 10.1016/s0167-4889(98)00044-5. [DOI] [PubMed] [Google Scholar]
- 59.Roth C, Murray D, Scott A, Fu C, Averette AF, Sun S, Heitman J, and Magwene PM (2021). Pleiotropy and epistasis within and between signaling pathways defines the genetic architecture of fungal virulence. PLoS Genet 17, e1009313. 10.1371/journal.pgen.1009313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Kwon-Chung KJ, Fraser JA, Doering TL, Wang Z, Janbon G, Idnurm A, and Bahn YS (2014). Cryptococcus neoformans and Cryptococcus gattii, the etiologic agents of cryptococcosis. Cold Spring Harbor perspectives in medicine 4, a019760. 10.1101/cshperspect.a019760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Caza M, and Kronstad JW (2019). The cAMP/Protein Kinase a Pathway Regulates Virulence and Adaptation to Host Conditions in Cryptococcus neoformans. Frontiers in cellular and infection microbiology 9, 212. 10.3389/fcimb.2019.00212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Fernandes KE, Fraser JA, and Carter DA (2022). Lineages Derived from Cryptococcus neoformans Type Strain H99 Support a Link between the Capacity to Be Pleomorphic and Virulence. mBio 13, e0028322. 10.1128/mbio.00283-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Jackson KM, Ding M, and Nielsen K (2023). Importance of Clinical Isolates in Cryptococcus neoformans Research. J Fungi (Basel) 9. 10.3390/jof9030364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Sedlazeck FJ, Rescheneder P, and von Haeseler A (2013). NextGenMap: fast and accurate read mapping in highly polymorphic genomes. Bioinformatics 29, 2790–2791. 10.1093/bioinformatics/btt468. [DOI] [PubMed] [Google Scholar]
- 65.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, and Genome Project Data Processing, S. (2009). The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079. 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Layer RM, Chiang C, Quinlan AR, and Hall IM (2014). LUMPY: a probabilistic framework for structural variant discovery. Genome Biol 15, R84. 10.1186/gb-2014-15-6-r84. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Faust GG, and Hall IM (2012). YAHA: fast and flexible long-read alignment with optimal breakpoint detection. Bioinformatics 28, 2417–2424. 10.1093/bioinformatics/bts456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Garrison E, and Marth G (2012). Haplotype-based variant detection from short-read sequencing. arXiv 10.48550/arXiv.1207.3907. [DOI] [Google Scholar]
- 69.Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, and Ruden DM (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92. 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Abyzov A, Urban AE, Snyder M, and Gerstein M (2011). CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 21, 974–984. 10.1101/gr.114876.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Mansfeld BN, and Grumet R (2018). QTLseqr: An R Package for Bulk Segregant Analysis with Next-Generation Sequencing. Plant Genome 11. 10.3835/plantgenome2018.01.0006. [DOI] [PubMed] [Google Scholar]
- 72.Fu J, Hettler E, and Wickes BL (2006). Split marker transformation increases homologous integration frequency in Cryptococcus neoformans. Fungal Genet Biol 43, 200–212. S1087-1845(06)00023-5 [pii] 10.1016/j.fgb.2005.09.007. [DOI] [PubMed] [Google Scholar]
- 73.Reuwsaat JCV, Agustinho DP, Motta H, Chang AL, Brown H, Brent MR, Kmetzsch L, and Doering TL (2021). The Transcription Factor Pdr802 Regulates Titan Cell Formation and Pathogenicity of Cryptococcus neoformans. mBio 12. 10.1128/mBio.03457-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Maier EJ, Haynes BC, Gish SR, Wang ZA, Skowyra ML, Marulli AL, Doering TL, and Brent MR (2015). Model-driven mapping of transcriptional networks reveals the circuitry and dynamics of virulence regulation. Genome Res 25, 690–700. 10.1101/gr.184101.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wingett SW, and Andrews S (2018). FastQ Screen: A tool for multi-genome mapping and quality control. F1000Res 7, 1338. 10.12688/f1000research.15931.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Kim D, Paggi JM, Park C, Bennett C, and Salzberg SL (2019). Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 37, 907–915. 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Liao Y, Smyth GK, and Shi W (2014). featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930. 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- 78.Love MI, Huber W, and Anders S (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550. 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Ignatiadis N, Klaus B, Zaugg JB, and Huber W (2016). Data-driven hypothesis weighting increases detection power in genome-scale multiple testing. Nat Methods 13, 577–580. 10.1038/nmeth.3885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Ball B, and Geddes-McAlister J (2019). Quantitative Proteomic Profiling of Cryptococcus neoformans. Curr Protoc Microbiol 55, e94. 10.1002/cpmc.94. [DOI] [PubMed] [Google Scholar]
- 81.Rappsilber J, Mann M, and Ishihama Y (2007). Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips. Nat Protoc 2, 1896–1906. 10.1038/nprot.2007.261. [DOI] [PubMed] [Google Scholar]
- 82.Cox J, and Mann M (2008). MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26, 1367–1372. 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Whole genome sequence data have been deposited in the Sequence Read Archive (SRA) as Project number PRJNA967729 and are publicly available as of the date of publication; accession numbers are listed in Supplemental Table 1 as noted in the key resources table. RNA-seq data have been deposited at Gene Expression Omnibus (GEO) as project number GSE232437 and are publicly available as of the date of publication. Accession numbers are listed in Supplemental Table 1 as noted in the key resources table. Proteomics data is available through PRIDE: Proteome Xchange under accession number PXD045163 and will be made available to the public once the paper is published with a DOI.
Key resources table.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| Monoclonal ANTI-FLAG® M2 antibody | Sigma-Aldrich | Cat# F3165; RRID: AB_259529 |
| IRDye® 680RD Goat anti-Mouse IgG Secondary Antibody | LI-COR Biosciences | Cat# 926-68070; RRID: AB_10956588 |
| Bacterial and virus strains | ||
| Cryptococcus neoformans: KN99a and KN99α (38) | Joe Heitman, Duke University | [] |
| Cryptococcus neoformans: clinical and environmental isolates | John Perfect, Duke University | This paper, Supplemental Table 1A |
| Cryptococcus neoformans: clinical isolates | CINCH Consortium | This paper, Supplemental Table 1A |
| Cryptococcus neoformans: gene deletion library (4) | Fungal Genetics Stock Center | Madhani plates (2015, 2016, 2020) |
| Cryptococus neoformans: engineered strains | Authors | This paper, Supplemental Table 1B and 1D |
| Cryptococcus neoformans: recombinant strains | Authors | This paper, Supplemental Table 1B |
| Chemicals, peptides, and recombinant proteins | ||
| Yeast extract | Fisher Scientific | Cat# DF0127179 |
| Bacto peptone | ThermoFisher | Cat# 211677 |
| Phosphate buffered saline | Fisher Scientific | Cat# MT21040CM |
| V8 juice | Campbell Soup Company | |
| RPMI medium | Sigma-Aldrich | Cat# R8758-6X500ML |
| Mouse serum | BioIVT | Cat# MSE01SRMUN5, lot # MSE308112 |
| 13C-5-adenosine cAMP | Toronto Research Chemicals | Cat # A280457 |
| Critical commercial assays | ||
| TRIzol reagent | Ambion | Cat# 15-596-018 |
| NEBNext Ultra RNA Library Prep Kit for Illumina | New England Biolabs | Cat# E7420L |
| NEBNext Poly(A) mRNA Magnetic Isolation Module | New England Biolabs | Cat# E7490L |
| NEBNext Ultra II DNA Library Prep Kit for Illumina | New England Biolabs | Cat# E7645L |
| Deposited data | ||
| Whole genome sequence data | Sequence Read Archive project number PRJNA967729 | Accession numbers listed in Supplementary Table 1A–D |
| RNA-seq data | Gene Expression Omnibus (GEO) project number GSE232437 | Accession numbers listed in Supplementary Table 1E |
| Proteomics data | PRIDE: Proteome Xchange | Accession number: PXD045163 |
| Experimental models: Organisms/strains | ||
| Mice: C57Bl/6 | Jackson Laboratory | Cat#000664 RRID:IMSR_JAX:000664 |
| Software and algorithms | ||
| GraphPad Prism 9 | GraphPad Software, Inc. | |
| NextGenMap | Sedlazeck et al. (64) | https://cibiv.github.io/NextGenMap/ |
| SAMtools | Li et al. (65) | http://www.htslib.org/ |
| Lumpy | Layer et al. (66) | https://github.com/arq(5)x/lumpy-sv |
| Yaha | Faust et al. (67) | https://github.com/GregoryFaust/yaha |
| MergeSamFiles from Picard tools version 2.10.0 | Broad Institute | https://github.com/broadinstitute/picard |
| FreeBayes version 1.1.0 | Garrison et al. (68) | https://github.com/freebayes/freebayes |
| SNPEff version 4.3.1 | Cingolani et al. (69) | https://pcingola.github.io/SnpEff/ |
| CNVnator 0.3.2 | Abyzov et al. (70) | https://github.com/abyzovlab/CNVnator |
| QTLseqr | Mansfield et al. (71) | https://github.com/bmansfeld/QTLseqr |
| FastQC | Wingett et al. (75) | https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ |
| Hisat2 version 2.2.1 | Kim et al. (76) | http://daehwankimlab.github.io/hisat2/ |
| Subread 2.0.0 | Liao et al. (77) | https://rnnh.github.io/bioinfo-notebook/docs/featureCounts.html |
| DESeq2 | Love et al. (78) | https://bioconductor.org/packages/release/bioc/html/DESeq2.html |
| Bulk Segregant Analysis (BSA) and Association Analysis (AA) | This paper | https://github.com/DanielPAagustinho/BSA_crypto_analysis |
| IHW (independent hypothesis weighting) package | Ignatiadis et al. (79) | https://github.com/nignatiadis/IHW |
| MaxQuant software (version 2.0.3) | Cox et al. (82) | https://maxquant.net |
All original code has been deposited at Github and is publicly available as of the date of publication. DOI information is listed in the key resources table.
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
