Abstract
The DNA mismatch repair (MMR) pathway corrects mismatched bases produced during DNA replication and is highly conserved across the tree of life, reflecting its fundamental importance for genome integrity. Loss of function in one or a few MMR genes can lead to increased mutation rates and microsatellite instability, as seen in some human cancers. Although loss of MMR genes has been documented in the context of human disease and in hypermutant strains of pathogens, examples of entire species and species lineages that have experienced substantial MMR gene loss are lacking. We examined the genomes of 1,107 species in the fungal phylum Ascomycota for the presence of 52 genes known to be involved in the MMR pathway of fungi. We found that the median ascomycete genome contained 49/52 MMR genes. In contrast, four closely related species of obligate plant parasites from the powdery mildew genera Erysiphe and Blumeria, have lost between five and 21 MMR genes, including MLH3, EXO1, and DPB11. The lost genes span MMR functions, include genes that are conserved in all other ascomycetes, and loss of function of any of these genes alone has been previously linked to increased mutation rate. Consistent with the hypothesis that loss of these genes impairs MMR pathway function, we found that powdery mildew genomes with higher levels of MMR gene loss exhibit increased numbers of mononucleotide runs, longer microsatellites, accelerated sequence evolution, elevated mutational bias in the A|T direction, and decreased GC content. These results identify a striking example of macroevolutionary loss of multiple MMR pathway genes in a eukaryotic lineage, even though the mutational outcomes of these losses appear to resemble those associated with detrimental MMR dysfunction in other organisms.
Keywords: mutation rate, Ascomycota, microsatellite, phylogenetics, powdery mildew, DNA repair
Significance
The DNA mismatch repair pathway corrects nucleotide base errors that occur during the replication of DNA; loss of these genes leads to cancer. We examined the conservation of the DNA mismatch repair pathway’s genes across more than 1,000 species in a fungal phylum and found a lineage of powdery mildews, a group of fungi that infect the leaves of plants, which have experienced extensive loss of multiple, otherwise highly conserved, genes. The genomes of these powdery mildews show elevated rates of diverse types of mutation, raising the hypothesis that these organisms have diversified while lacking genes thought to be essential for the accurate replication of DNA.
Introduction
An ensemble of DNA repair pathways and cell-cycle checkpoints is responsible for detecting and repairing DNA damage, ensuring faithful maintenance of the genome (Friedberg et al. 2005; Giglia-Mari et al. 2011). Among DNA repair pathways, the DNA mismatch repair (MMR) pathway is one of the best characterized (Marti et al. 2002). The MMR pathway is responsible for repairing bases that were incorrectly paired during DNA replication via five steps: recognition, incision, removal, resynthesis, and ligation (fig. 1A) (Marti et al. 2002; Fukui 2010; Hsieh and Zhang 2017). The MMR pathway is highly conserved in both bacteria and eukaryotes; cells that experience reduction or loss of function in this pathway have increased levels of mutation, as seen in cancer and drug-resistant fungal pathogen strains (Fukui 2010; Billmyre et al. 2017, 2020; Campbell et al. 2017; Dos Reis et al. 2019).
Fig. 1.
The DNA MMR pathway corrects mismatched bases produced during DNA replication and prevents instability in microsatellites. (A) The pathway comprised five conserved steps: recognition of mispaired bases by a sliding clamp, incision of the DNA strand by an endonuclease, excision of the incorrectly paired bases, resynthesis of the DNA strand, and ligation of the newly synthesized segment to the DNA strand. Supplementary table S2 includes a full list of MMR genes and their categorization into one of the five steps. (B) The MMR pathway also recognizes base or repeat addition and skipping errors during replication and corrects them. However, dysfunction in this pathway can leave replication slippage unrepaired, leading to the and the addition or deletion of base pairs or repeats, especially in highly repetitive regions such as microsatellites.
Although DNA repair genes are generally highly conserved, certain fungal lineages have been reported to have a more limited repertoire, particularly within the phylum Ascomycota. For example, budding yeasts (subphylum Saccharomycotina) and fission yeasts (Taphrinomycotina) have fewer DNA repair genes than filamentous fungi (Pezizomycotina) (Milo et al. 2019; Shen et al. 2020). Furthermore, DNA repair genes that were lost from budding yeasts and fission yeasts are more likely to also be lost in filamentous fungi (Milo et al. 2019). One lineage that has experienced extensive losses in its repertoire of DNA repair genes is the Hanseniaspora genus of budding yeasts (Steenwyk et al. 2019). Hanseniaspora species have undergone punctuated sequence evolution and have accumulated large numbers of diverse types of substitutions, including those associated with specific gene losses such as UV damage, suggesting that DNA repair is impaired by the high levels of DNA repair gene loss. These findings suggest that DNA repair genes are not universally conserved across fungi and that their loss is compatible with long-term evolutionary survival and diversification of fungal lineages.
One well-established consequence of MMR dysfunction is mutation in microsatellite regions of the genome. Microsatellites are repetitive tracts of DNA, with motifs 1–6 bp long repeated at least five times (Beier et al. 2017). Microsatellites are typically highly polymorphic between individuals and are commonly used as markers in population biology, forensics, paternity testing, and tumor characterization (Richman 2015). Due to their repetitive nature, microsatellites are prone to experiencing polymerase slippage, which is usually corrected by the MMR pathway (fig. 1B) (Ellegren 2004; Richman 2015). If the MMR pathway does not recognize these errors, as is the case in cancer, microsatellite instability (MSI) can occur (Campbell et al. 2017). MSI is defined by a hypermutable phenotype resulting from a loss of function in the MMR pathway (Boland and Goel 2010). Instability in microsatellites trends toward elongation in these regions, but contraction can also occur (Ellegren 2004).
Beyond increased mutation in microsatellite regions, aberrant function of the MMR pathway is associated with genome-wide signatures of genetic instabilities (Boland and Goel 2010; Billmyre et al. 2017, 2020). MMR mutations have been implicated in the development of hypermutant and ultrahypermutant human cancers, which constitute approximately 15% of human tumors and less than 1% of tumors, respectively (Campbell et al. 2017). Interestingly, very few tumors with low mutation rates contained mutations in the MMR pathway, whereas more than a third of hypermutant tumors and virtually all the ultrahypermutants contained mutations in MMR genes (Campbell et al. 2017). Hypermutant tumors had high levels of MSI suggesting their hypermutant phenotype is due, at least in part, to MMR dysfunction (Campbell et al. 2017). Hypermutation has also been observed in fungal pathogen strains that have lost MMR pathway genes, potentially driving within-host adaptation and the evolution of drug resistance. For example, Rhodes et al. (2017) found that hypermutation caused by mutations in three MMR pathway genes, including MSH2, resulted in a rapid increase in the mutation rate of the human pathogenic fungus Cryptococcus neoformans, contributing to infection relapse. Similarly, Billmyre et al. (2017) sequenced multiple strains of the human pathogenic fungus Cryptococcus deuterogattii (phylum Basidiomycota) and found that a group of strains with mutations in the MSH2 gene experienced higher rates of mutation when compared with closely related strains harboring an intact MSH2 gene. Hypermutation in Cryptococcusdeuterogattii mediated rapid evolution of antifungal drug resistance (Billmyre et al. 2017, 2020).
In contrast to MMR gene loss in the microevolutionary context of genetic or infectious disease, the extent of MMR gene loss across lineages spanning multiple species remains understudied. To determine the macroevolutionary impact of MMR gene conservation and loss, we characterized patterns of MMR gene presence and absence in the fungal phylum Ascomycota (fig. 2). We found that the MMR pathway was highly conserved across Ascomycota, with the median species having 49/52 MMR genes present. However, we found that Blumeria graminis and species in the powdery mildew genus Erysiphe (subphylum Pezizomycotina, class Leotiomycetes), a group of obligate plant parasites, had many fewer MMR genes and a faster rate of sequence evolution than their relatives and most other fungal taxa. Specifically, Erysiphe necator has lost nine MMR genes, Erysiphe pisi has lost 21 MMR genes, Erysiphe pulchra has lost seven MMR genes, and B.graminis has lost five MMR genes (fig. 3). In contrast, species closely related to Erysiphe and Blumeria have lost only 1–2 MMR genes, consistent with the high degree of MMR gene conservation in the rest of the phylum. Evolutionary genomic analyses revealed that MMR gene losses in Erysiphe and Blumeria (hereafter referred to as higher loss taxa or HLT) are associated with a proliferation of mononucleotide runs and elongation of microsatellites of all motif lengths, both of which are hallmarks of MMR pathway dysfunction. Reflecting these losses, Erysiphe and Blumeria genomes also have more pronounced mutational biases and accelerated rates of mutation. These results suggest that obligate plant parasites in the genera Blumeria and Erysiphe have diversified while lacking otherwise highly conserved MMR genes.
Fig. 2.
Conservation of MMR pathway genes across the fungal phylum Ascomycota. MMR genes are generally highly conserved across the phylum. A few model organisms and species of particular interest to medicine and agriculture are labeled as well as a representative species of the faster-evolving Hanseniaspora lineage. Gene presences are indicated in the bands surrounding the phylogeny with genes colored according to their function; red is recognition, orange is incision, yellow is excision, green is resynthesis, and purple is ligation. Branches are colored by subphylum; budding yeasts/Saccharomycotina (n = 332 species) are in red, fission yeasts/Taphrinomycotina (n = 14 species) are in purple, and filamentous fungi/Pezizomycotina (n = 761 species) are in green. Taxon names have been omitted from the phylogeny for visualization purposes; the phylogenetic tree with taxon names can be found in supplementary figure S1 and Shen et al. (2020). The inset phylogenetic tree shows the HLT (in blue) and LLT (in black), with the blue box beneath highlighting them.
Fig. 3.
The powdery mildews Erysiphe and Blumeria have lost many more MMR pathway genes than closely related species. HLT (shown in blue font) have lost 5–21 MMR genes, whereas LLT (shown in black font) have lost 1–2 genes and the median ascomycete has lost three genes. Note that the losses of EXO1, MLH2, MLH3, MSH5, PMS1, PMS2, and RFC1 are uniquely observed in HLT. Genes are colored according to their function; red is recognition, orange is incision, yellow is excision, green is resynthesis, and purple is ligation.
Results
MMR Genes Are Highly Conserved across the Fungal Phylum Ascomycota
By examining the presence of 52 MMR genes using a combination of sequence similarity search algorithms across the genomes of 1,107 fungal species, we found that the MMR pathway is highly conserved across Ascomycota (a median of 49/52 MMR genes per species; fig. 2 and supplementary file S1). Sixteen genes were present in all species; these included five recognition genes (MSH1, MSH2, MSH3, MSH4, and MSH6), one incision gene (MLH1), one removal gene (DIN7), five resynthesis genes (CDC6, RFC2, RFC3, RFC4, and RFC5), and all four ligation genes (ADL1, CDC17, CDC9, and LIG1). Few genes experienced extensive loss. Of the 11 most commonly lost genes, which were lost in >5% of species, two (MYH1 and UVE1) were lost in the common ancestor of Saccharomycotina, in addition to losses observed in other taxa. The remaining nine genes are unevenly distributed across functions; seven are involved in DNA resynthesis (CDC27, CDM1, POL32, POLD3, POLD4, RPA3, and SSB3), one is involved in mismatch recognition (HSM3), one is involved in incision (HNT3). These findings suggest that genes in the MMR pathway are well conserved across Ascomycota.
A comparison of our results with those reported in Milo et al. (2019) revealed similar patterns of gene presence and absence. For example, Milo et al. (2019) found that MYH1 was absent from much of Pezizomycotina, which is consistent with our results. However, we did identify a few differences (inferred losses by Milo et al. [2019] vs. inferred presence in our analyses), which suggest that our pipeline is more conservative in classifying gene losses. We surmise that these differences stem from differences in the gene detection pipelines employed and the divergent objectives of the two studies; Milo et al. (2019) aimed to identify orthologs via a reciprocal best BLAST hit approach, whereas we aimed to identify homologs using pHMMs with absences verified using TBlastN. Importantly, analysis of the human proteome using our pipeline detected all known human orthologs except for HSM3, POL32, RPA3, and SSB3, suggesting that our pipeline is well suited to detect MMR genes within Ascomycota, but that a few MMR genes may be more rapidly evolving and therefore more difficult to detect.
Extensive Loss of MMR Genes in a Lineage of Powdery Mildews
Although MMR genes are highly conserved across Ascomycota, we found that a lineage of obligate plant parasite powdery mildews has among the fewest MMR genes of the 1,107 Ascomycota species examined. We further verified gene losses in HLT and in closely related taxa that experienced fewer losses (hereafter referred to as lower loss taxa or LLT) by performing TBlastN using the amino acid sequences found by the pHMM for each MMR gene of Cairneyella variabilis, an LLT with a highly conserved MMR pathway, as queries and with thresholds described by Milo et al. (2019). This resulted in nine MMR genes losses in E.necator, 21 in E.pisi, and seven in E.pulchra (fig. 3). Erysiphenecator has been previously documented to have a high rates of genome evolution (Milo et al. 2019) and genomic instability (Jones et al. 2014). Blumeria graminis, which is sister to the Erysiphe genus, has lost five MMR genes; previous studies reported extensive gene loss in diverse pathways in this species, generally in genes thought to be unnecessary for its biotrophic lifestyle (Spanu et al. 2010). In contrast, the closely related species C. variabilis and Phialocephala scopiformis only lack MYH1, an adenine DNA glycosylase that is lost in most filamentous fungi (Chang et al. 2001). In addition to MYH1, closely related species Articulospora tetracladia lacks HSM3, which has been lost in many Pezizomycotina genomes. Ascocoryne sarcoides lacks MYH1 and POL32, a DNA polymerase δ subunit, which is part of a larger complex that participates in multiple DNA repair pathways, including nucleotide excision repair and base excision repair (Gerik et al. 1998). Much like the rest of the phylum, genes associated with resynthesis are lost more frequently, but Erysiphe and Blumeria have lost genes associated with all MMR functions (supplementary table S2) except ligation. In addition, seven of the observed MMR gene losses occur nowhere else in Ascomycota: EXO1 (excision), MLH2 (incision), MLH3 (incision), MSH5 (recognition), PMS1 (incision), PMS2 (incision), and RFC1 (resynthesis). Taken together, these results raise the hypothesis that HLT may have a partially functional MMR pathway.
Although select Erysiphe taxa have lost more genes than any other species, there are other species with moderate to high levels of MMR gene loss across the phylum. A total of 318 species have lost five or more genes across Ascomycota: five species in subphylum Taphrinomycotina, 239 in Saccharomycotina, and 74 in Pezizomycotina. The disproportionate number of Saccharomycotina and Taphrinomycotina species is consistent with our knowledge that organisms in these lineages have, on average, a smaller number of DNA repair genes compared with Pezizomycotina (Milo et al. 2019). MMR gene loss in certain Saccharomycotina lineages, such as in some species from the genera Hanseniaspora (Steenwyk, Opulente, et al. 2019), Tetrapisispora, and Dipodascus, is comparable to the loss observed in HLT. However, only nine other species in Pezizomycotina showed MMR gene loss to the same degree as any Erysiphe species. In general, species with elevated levels of gene loss primarily lost genes noted as commonly lost earlier in this paper (see MMR Genes are Highly Conserved across the Fungal Phylum Ascomycota), with occasional losses in other genes.
There was a notable discrepancy between the presence and absence of MMR genes inferred by pHMM versus TBlastN in HLT that was not observed in other species. When measured by pHMM, E. pulchra lost 42 MMR genes, as opposed to nine when using TBlastN with model organism sequences as queries and seven when using TBlastN with C. variabilis sequences as queries to verify the absences. Erysiphenecator and E. pisi lost 51 MMR genes according to our pHMMs, as opposed to ten and 22 by model organism TBlastN and nine and 21 with C. variabilis TBlastN, respectively. Blumeriagraminis lost nine MMR genes by pHMM, six by model organism TBlastN, and five by C. variabilis TBlastN. In the closely related LLT C. variabilis, P. scopiformis, A. tetracladia, and Asc. sarcoides, genes deemed absent by pHMMs were also deemed absent in our model organism and C. variabilis TBlastN searches; the sole exception was HSM3, which was identified as present in Asc. sarcoides using C. variabilis TBlastN. Even though pHMMs are more sensitive in sequence similarity searches and typically outperform TBlastN when detecting genes on an evolutionary timescale (Yoon 2009), this discrepancy is likely explained by the lower annotation quality of some HLT species and the lower genome quality for E. pisi (supplementary table S1).
Higher MMR Gene Loss Taxa Show Increased Number and Length of Microsatellites
Examination of microsatellites revealed microsatellite expansions in HLT in comparison to LLT. Specifically, we found statistically significant increases in the number and length of microsatellites in HLT compared with LLT (fig. 4A, table 1, and supplementary tables S3 and S4). Overall, after controlling for genome size, HLT had significantly more microsatellites than LLT (F = 34.83; P < 0.001; ANOVA; supplementary table S3). This effect was driven by the very large increase in the number of mononucleotide runs in Erysiphe and Blumeria (fig. 4C) (P < 0.001; Tukey honest significant difference [HSD]; supplementary table S3). There was no statistically significant difference between the groups in the numbers of microsatellites with a 2–6 bp motif length (supplementary table S3). HLT showed significantly higher average microsatellite lengths at every motif size than LLT (fig. 4B) (P < 0.01 for 1 bp, P < 0.001 for all other motif lengths; Wilcoxon rank sum test; supplementary table S4). HLT have an increased number of mononucleotide runs (after controlling for genome size) and an increase in length of microsatellites of all motif lengths, suggesting that the MMR pathway’s function is compromised in these species.
Fig. 4.
Genomes of HLT (blue bars) show a proliferation of mononucleotide runs and an increase in their microsatellite lengths compared with LLT (gray bars). (A) Examination of microsatellites in HLT and LLT (gray bars) showed a significant increase in the number of mononucleotide runs in HLT (P < 0.001; ANOVA, Tukey HSD; supplementary table S3). Asterisks in graph indicate significance; **P < 0.01; ***P < 0.001. (B) Microsatellites of each motif length are significantly longer in HLT (P < 0.01 for 1 bp, P < 0.001 for all other motif lengths; Wilcoxon rank sum test; supplementary table S4). (C) Mononucleotide runs are longer and more numerous in HLT than LLT (supplementary tables S3 and S4).
Table 1.
MMR Gene Loss Is Associated with an Increased Number of Microsatellites
Species | Group | Genome Size (Mb) | 1 bp | 2 bp | 3 bp | 4 bp | 5 bp | 6 bp |
---|---|---|---|---|---|---|---|---|
A. tetracladia | LLT | 41.8 | 306 | 725 | 1,028 | 525 | 288 | 174 |
Asc. sarcoides | LLT | 34.3 | 213 | 677 | 601 | 365 | 164 | 75 |
C. variabilis | LLT | 50.7 | 420 | 374 | 336 | 54 | 47 | 69 |
P. scopiformis | LLT | 48.9 | 325 | 475 | 840 | 301 | 159 | 71 |
B. graminis | HLT | 124.5 | 6,379 | 1,047 | 1,752 | 659 | 389 | 190 |
E. necator | HLT | 52.5 | 4,616 | 2,829 | 1,135 | 1,172 | 290 | 145 |
E. pisi | HLT | 69.3 | 4,463 | 1,841 | 1,303 | 521 | 80 | 125 |
E. pulchra | HLT | 63.5 | 2,681 | 969 | 990 | 522 | 89 | 87 |
Note.—LLT: Articulospora tetracladia, Ascocoryne sarcoides, Cairneyella variabilis, Phialocephala scopiformis. HLT: Blumeria graminis, Erysiphe necator, Erysiphe pisi, Erysiphe pulchra.
HLT Show Mutational Biases
By examining patterns of substitutions among HLT and LLT, we found that HLT displayed stronger mutational biases associated with impaired DNA repair pathway function in comparison to LLT. For example, significantly more substitutions were observed at all codon positions in HLT versus LLT (fig. 5A) (P < 0.01; Tukey HSD; supplementary table S5) and a significant bias toward substitutions in the A|T direction (fig. 5B) (P < 0.001; Tukey HSD; supplementary table S6). HLT also had a lower ratio of transitions to transversions (0.92 ± 0.04) than LLT (0.99 ± 0.02), though this is not statistically significant (fig. 5C) (P = 0.06; Wilcoxon rank sum exact test; supplementary table S7). Additionally, HLT had lower GC content (HLT: 40.10 ± 0.02% vs. LLT: 47.49 ± 0.01%). Linear regression revealed a significant decrease (F = 8.661, P = 0.026; supplementary table S8) in GC content of the genomes as the number of MMR genes lost increases (fig. 5D).
Fig. 5.
HLT show diverse types of mutational bias compared with LLT. (A) HLT (blue bars/fonts) show increased counts in base substitution at every codon position when compared with LLT (gray bars/fonts) (P < 0.01; ANOVA, Tukey HSD; supplementary table S5). (B) HLT show significant mutational bias toward mutation in the A|T direction, whereas this trend is not significant in LLT (P < 0.001; P = 0.27; ANOVA, Tukey HSD; supplementary table S6). (C) HLT show a decreased ratio of transitions to transversions when compared with LLT, although this difference is not statistically significant (P = 0.06; supplementary table S7). (D) Genome GC content decreases with increasing MMR gene loss (adjusted R2=0.5225; P = 0.026; linear regression; supplementary table S8). Asterisks in graphs indicate significance; **P < 0.01; ***P < 0.001.
HLT Have Experienced Accelerated Rates of Sequence Evolution
To test if the rate of evolution of HLT differed from that of LLT, we performed ω-based branch tests. Our null hypothesis was that all branches of the phylogeny for our selected eight species had the same rate of evolution, whereas our alternate hypothesis posited that HLT branches, including the branch of their most recent common ancestor, experienced a different rate of sequence evolution than LLT branches. We found that 60.75% of genes rejected the null hypothesis (α = 0.05; n = 500) and 39.25% failed to reject the null (n = 323) (fig. 6A and supplementary file S2). Of the genes which rejected the null hypothesis, 79.80% (n = 399) experienced higher rates of substitution in HLT, which constitutes 48.48% of all genes tested (fig. 6A). Among the genes that rejected the null hypothesis, the difference between the ω values for the HLT (median ω = 0.0899) and the LLT (median ω = 0.0567) showed accelerated rates of substitution for HLT branches (fig. 6B). These results suggest that MMR gene loss is associated with a genome-wide signature of accelerated mutation rates.
Fig. 6.
Powdery mildew HLT show accelerated rates of evolution. (A) Most (60.75%; n = 500) BUSCO genes reject the null hypothesis that HLT branches experience the same rate of substitution as LLT branches. About 48.48% of BUSCO genes support a higher rate of evolution for HLT (n = 399; in blue), 12.27% support a higher rate of evolution for LLT (n = 101; in gray), and 39.25% (n = 323) fail to reject the null hypothesis that the rate of substitution is uniform across HLT and LLT branches (in white). Among genes that support the alternative hypothesis, 79.80% (n = 399) support that Erysiphe and Blumeria evolve more quickly than LLT. (B) Among genes which reject the null hypothesis, the distribution of differences between ω values for HLT and LLT branches show elevated substitution rates in HLT.
Discussion
Using sequence similarity searches, we examined the conservation of the MMR pathway across 1,107 ascomycete species. The near universal conservation of the vast majority of MMR genes across the phylum confirms this pathway’s known critical role for DNA maintenance (Schofield and Hsieh 2003; Kunkel and Erie 2005; Fukui 2010). However, we also discovered that a lineage of Erysiphe and Blumeria powdery mildews, named HLT, have experienced significant MMR gene loss (fig. 3). HLT exhibit increases in mononucleotide run count, microsatellite length, mutational biases, and rate of evolution (figs. 4–6), suggesting that the function of their MMR pathway may be impaired. Although DNA repair mechanisms are present in all eukaryotes and are highly conserved, there is mounting evidence of exceptions to this rule in the fungal kingdom (Steenwyk, Opulente, et al. 2019; Steenwyk 2021). The increased MMR gene absence observed in HLT correlates with changes in the microsatellite compositions of their genomes. The significant difference between HLT and LLT in the number of mononucleotide runs is consistent with mutational patterns present in human cancers and MMR deficient yeast cells (Arzimanoglou et al. 1998; Lang et al. 2013). Mononucleotide runs are the most prone to replication fork slippage and are used to diagnose MSI in tumors (Richman 2015). In addition to an increase in the number of mononucleotide runs (fig. 4A), HLT showed significantly longer microsatellites for each motif length than LLT (fig. 4B), which suggests impaired MMR function and increased replication fork slippage.
Examination of HLT genomes revealed mutational signatures suggesting that the MMR pathway has been impaired by the observed gene losses. Patterns of substitutions suggest the loss of MMR genes leads to increased substitution rates (fig. 5A) and lower GC content (fig. 5D). More specifically, the prominent A|T bias of substitutions in the HLT is likely driven by the known A|T bias of mutations previously observed in bacteria and eukaryotes, including Saccharomycescerevisiae (Keightley et al. 2009; Hershberg and Petrov 2010; Lynch 2010; Zhu et al. 2014). Furthermore, GC content decreases in proportion to the number of MMR genes lost in the HLT and LLT, which was also observed among Hanseniaspora, a lineage of budding yeasts that have lost diverse DNA repair genes (Steenwyk, Opulente, et al. 2019). There is no significant difference between the transition to transversion (Ts/Tv) ratios of the HLT (0.92 ± 0.04) and LLT (0.99 ± 0.02), though the trend follows what we would expect for HLT having less efficient DNA repair. Both lineages exhibit near-neutral Ts/Tv ratios (Lynch et al. 2008; Zhu et al. 2014). Examination of ω values suggests that faster rates of sequence evolution in HLT compared with the LLT may be associated with MMR gene loss. Long branches, which reflect more substitutions per site, have been previously reported elsewhere for E. necator (Milo et al. 2019), providing independent support to our findings.
Species in the LLT lineage show a diversity of ecologies. For example, Asc. sarcoides is saprobic fungus which grows on trees (Gianoulis et al. 2012), A. tetracladia is a globally distributed aquatic hyphomycete (Seena et al. 2012), C. variabilis is an ericoid mycorrhizal fungus (Midgley et al. 2016), and P. scopiformis is a foliar endophyte (Walker et al. 2016). In contrast, species in the genera Erysiphe and Blumeria are all powdery mildews and obligate plant parasites. Blumeriagraminis is the only species in the genus Blumeria (Inuma et al. 2007), whereas there are ∼450 known species in the genus Erysiphe (Takamatsu et al. 2015). The Erysiphe species we sampled are distributed across the phylogeny of the genus (Takamatsu et al. 2015; Ellingham et al. 2019); phylogenetic analyses by Takamatsu et al. (2015) placed E. pisi and E. pulchra in separate phylogenetic groups that diverged 15–20 Ma, and analyses by Ellingham et al. (2019) showed that E. pisi and E. necator are distantly related (Ellingham et al. 2019). Considering our taxon sampling and obligate plant parasitic lifestyle of Erysiphe species, we hypothesize that our findings likely apply to all species in the genus.
In our approach to investigate the conservation of the MMR pathway in Ascomycota, we chose to be inclusive when selecting genes that function as part of the pathway and conservative when ruling genes absent. Some genes were computationally annotated as belonging to the MMR pathway based on sequence homology and others are implicated in multiple pathways. That said, of the 24 MMR genes lost in at least one HLT species, 16 are included in the KEGG MMR pathway entries for our three model organisms, suggesting that many of the observed losses concern genes directly involved in MMR. In addition, there may be other contributing factors to the observed mutational differences between HLT and LLT, such as dysfunction in other DNA repair pathways, loss of methyl transferases, and differences in chromatin structure (Steenwyk, Opulente, et al. 2019; Möller et al. 2021). For example, previous studies have identified the loss of genes involved in the repeat-induced point mutation pathway in powdery mildews (Spanu et al. 2010; van Wyk et al. 2020). Loss of genes in the repeat-induced point pathway in HLT could contribute to the elevated mutation rates we observed relative to LLT. The MMR pathway is required for maintaining heterochromatin stability in S. cerevisiae; dysfunction in this pathway could potentially lead to genome instability within the HLT (Dahal et al. 2017). MMR is more error prone in heterochromatin than euchromatin, likely due at least in part to mechanical accessibility of the MMR machinery (Sun et al. 2016; Dahal et al. 2017). Although base–base mismatches are repaired less efficiently in heterochromatin than in euchromatin, single-nucleotide insertions and deletions are repaired with similar efficiency in euchromatin and heterochromatin (Dahal et al. 2017); therefore, differences in chromatin structure could have contributed to some of the observed mutational differences but not to the observed differences in mononucleotide runs and microsatellite repeats.
Loss of function in the MMR pathway may be advantageous in certain environments or under certain lifestyles. For example, strains of human pathogens with impaired MMR function are found in environments where antifungal drugs are present. Some of these strains have evolved drug resistance, so the elevated mutation rate generated by MMR gene loss may be adaptive under certain stressful situations (Billmyre et al. 2017, 2020; Rhodes et al. 2017). Species with higher levels of MMR gene loss may be associated with a parasitic lifestyle, though not all parasites have high levels of MMR gene loss; dysfunction in this pathway may be more adaptive, or at least less detrimental, to these organisms, as seen in the HLT. Loss of DNA repair pathways is also observed in other parasites and may contribute to elevated mutation rates (Gill and Fast 2007; Derilus et al. 2021). Organisms with parasitic lifestyles tend to evolve more rapidly than free-living organisms; whereas these mechanisms are unknown, previous work has identified that the loss of the classical nonhomologous end joining (C-NHEJ) pathway is common in this lifestyle and may even be a contributing factor (Nenarokova et al. 2019). Previous studies of genome structure in E. necator have found genome expansion largely driven by transposable elements and suggest that genome instability, particularly in copy number variants, can mediate rapid evolution of fungicidal resistance (Jones et al. 2014). The evolution of fungicide resistance in powdery mildews has implications for agriculture; major crops are impacted by these pathogens and some are able to quickly evolve resistance to antifungal chemicals, with resistance evolution accelerated by increased use (Jones et al. 2014; Vielba-Fernández et al. 2020). More broadly, genome instability among HLT taxa reflects their parasitic lifestyle, which is associated with gene loss and plastic genomic architecture (Schmidt and Panstruga 2011). Gene loss in primary and secondary metabolism, enzymes acting on carbohydrates, and transporters has been documented in B. graminis, as well as massive expansion in retrotransposons and genome size, reflecting extreme genomic changes associated with its parasitic lifestyle (Spanu et al. 2010). The lost genes are involved in diverse pathways, including anaerobic fermentation, biosynthesis of glycerol from glycolytic intermediates, and nitrate assimilation, and include multiple subfamilies of transporters (Spanu et al. 2010). Given their extreme genomic changes and importance to agriculture, Blumeria and Erysiphe may be novel models to study the outcome and evolutionary trajectory of sustained loss of MMR pathways.
Materials and Methods
Curation of the Set of DNA MMR Pathway Genes
To investigate the presence and absence of MMR genes across the fungal phylum Ascomycota, we curated a data set of MMR genes from the genomes of three fungal model organisms representing the three different subphyla: S.cerevisiae (subphylum Saccharomycotina), Neurospora crassa (Pezizomycotina), and Schizosaccharomyces pombe (Taphrinomycotina). We used three sources to curate genes that are part of the MMR pathway: the Kyoto Encyclopedia of Genes and Genomes (KEGG; Kanehisa and Goto 2000), the Schizosaccharomyces pombe database (PomBase; Lock et al. 2019; The Gene Ontology Consortium 2019), and the Saccharomyces Genome Database (SGD; Cherry et al. 2012). Aiming to be inclusive when selecting genes to be included as MMR, genes in the KEGG diagram of the MMR pathway for each species were included and the gene ontology term “mismatch repair” was used to search for the genes on SGD and Pombase (Ashburner et al. 2000). We used both computationally and manually curated genes from SGD. We began curating our set of MMR genes in S. cerevisiae, with a total of 30 MMR genes identified with KEGG and SGD. Next, we searched KEGG and Pombase for genes in Schizosaccharomycespombe that had not been annotated as part of the MMR pathway in S. cerevisiae (n = 15). We concluded by searching for N. crassa MMR genes in KEGG which had not already been categorized as MMR genes in the other two species (n = 7). KEGG listed two sequences for the N. crassa gene LIG1; however, since our sequence similarity search analyses with both sequences yielded identical patterns of loss, we present them as one gene. This approach yielded a total of 52 genes associated with MMR (supplementary table S2).
MMR Gene Conservation Analysis
To examine the conservation of MMR genes across Ascomycota, we implemented a sensitive probabilistic modeling approach using profile Hidden Markov Models (pHMMs) (Johnson et al. 2010) of MMR genes across the genomes of 1,107 species (Shen et al. 2020). To construct pHMMs, we first searched for putative homologs of MMR genes in the fungal RefSeq protein database using the BlastP function of BLAST+, v2.8.1, with a bitscore threshold of 50 and an e-value cutoff of 1×10−3 (Pearson 2013). We retrieved the top 100 hits using SAMtools, v1.6 (Li et al. 2009) with the “faidx” function. We used MAFFT, v7.402 (Katoh et al. 2002), with the “genafpair” and “reordered” parameters, a maximum of 1,000 cycles of iterative refinement, the BLOSUM62 matrix, a gap opening penalty of 1.0, and the retree parameter set to 1, to align the sequences following previously established protocol (Steenwyk, Shen, et al. 2019). We then used the aligned amino acid sequences as input to the “hmmbuild” function in HMMER-3.1B2 to construct each pHMM. We ran the pHMMs of the 52 proteins against all 1,107 proteomes using the “hmmsearch” function. For a gene to be considered present, we set a bitscore threshold of at least 50 and an e-value threshold of less than 1×10−10. We used the TBlastN function of BLAST+, v2.8.1 with a bitscore threshold of 50, e-value cutoff of 1×10−3, and 50% minimum query coverage to verify MMR gene absence using the protein sequence of the gene in question and the 1,107 Ascomycota genomes. We used the Interactive Tree of Life (iTOL), v4 (Letunic and Bork 2019) to visualize the conservation of MMR genes on the Ascomycota phylogeny and to map losses on it. To further verify gene absences in HLT and LLT with C. variabilis amino acid sequences, we used the TBlastN function of BLAST+, v2.8.1 with an e-value threshold of less than 1×10−5, a word size of five or more, and minimum query coverage of 80% (Milo et al. 2019).
Microsatellite Identification and Characterization
To identify microsatellites and evaluate their numbers and lengths between genomes with substantial MMR gene loss against those with higher levels of MMR gene conservation, we used the Microsatellite Identification tool (MISA), v2.0 (Beier et al. 2017). Specifically, we compared the microsatellites of two groups of taxa, each of which contained four species. The group of HLT contains the powdery mildews B.graminis, E.necator, E.pisi, and E.pulchra, which show high levels of MMR gene loss relative to other ascomycetes. The group of LLT contains four closely related species with low levels of MMR gene loss, similar to patterns seen across the rest of the phylum: A.tetracladia, Asc.sarcoides, C.variabilis, and P.scopiformis. The length minimums used for MISA to identify a microsatellite are as follows: 1-bp motifs must repeat 12 times, 2-bp motifs must repeat six times, 3- to 6-bp motifs must repeat five times. All values used are MISA defaults, except the mononucleotide parameter, which was increased from the default value of ten repeats to 12 (Temnykh 2001; Beier et al. 2017). A two-way ANOVA test was performed to test for significance in the number of microsatellites controlled by genome size of each motif length between HLT and LLT. If the two-way ANOVA rejected the null hypothesis (α = 0.05), pairwise comparisons were made with the Tukey HSD test. We performed the statistical analysis using R, v3.4.1, and made the figures using ggplot2, v3.1.0 (Wickerham 2016), and ggpubfigs, v1.0.0 (Steenwyk 2020).
Estimation of Mutational Bias and Rate of Sequence Evolution
To characterize the mutational spectra and estimate the rate of sequence evolution between HLT and LLT, we first identified and aligned orthologous sequences across all eight genomes. Orthologous single-copy protein sequences from genes present in all eight genomes (n = 823) were identified using the BUSCO, v4.0.4 (Waterhouse et al. 2018) pipeline and the OrthoDB, v10, Ascomycota database (Creation date: November 20, 2019) (Kriventseva et al. 2019). We hereafter refer to the 823 single-copy genes as BUSCO genes. BUSCO genes were aligned using MAFFT, v7.402 (Katoh et al. 2002), using the same settings described above. Codon-based alignments were generated by threading the corresponding nucleotide sequences onto the protein alignment using “thread_dna” function in PhyKIT, v0.1 (Steenwyk et al. 2021).
To examine patterns of substitutions, we used codon-based alignments to identify nucleotides that differed in a given taxon of interest compared with C. variabilis, which was the sister taxon to a clade comprised the other seven genomes of interest in the Ascomycota phylogeny. More specifically, we compared the character states for a species of interest to C. variabilis for each site of each alignment, tracking codon position information (i.e., first, second, or third codon position). We also determined if the substitution was a transition or transversion and examined substitution directionality (e.g., A|T to G|C or G|C to A|T) using C. variabilis as the outgroup. These analyses were completed using custom python scripts that utilize functions in Biopython, v1.70 (Cock et al. 2009).
Finally, we used the codon alignments to compare the rate of sequence evolution between HLT and LLT. Specifically, we measured the ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS or ω) along the species phylogeny for each gene using the CODEML function in PAML, v4.9 (Yang 2007). For each test, the null hypothesis (Ho) was that all branches had the same ω value (model=0); the alternative hypothesis (HA) was that all HLT branches, including the branch of their most recent common ancestor, had one ω value and all other branches had a distinct ω value (model=2). To determine if the alternative hypothesis was a better fit than the null hypothesis (α = 0.05), we used a likelihood ratio test.
Acknowledgments
We thank Qianhui Zheng for performing exploratory analysis on the HLT and LLT genomes. M.A.P. was partially supported by the Vanderbilt Undergraduate Summer Research Program and the Goldberg Family Immersion Fund. J.L.S. and A.R. were supported by the Howard Hughes Medical Institute through the James H. Gilliam Fellowships for Advanced Study Program. A.R.’s laboratory received additional support from the Burroughs Wellcome Fund, the National Science Foundation (DEB-1442113 and DEB-2110404), and the National Institutes of Health/National Institute of Allergy and Infectious Diseases (R56AI146096). X.-X.S. was supported by the National Natural Science Foundation of China (No. 32071665) and the Young Scholar 1000 Talents Plan. A.R. is a scientific consultant for LifeMine Therapeutics, Inc.
Data Availability
Supporting statistical analysis, the Ascomycota phylogeny with species names, and two supplementary data files (MMR gene presence/absence matrix and ω output) are available via figshare at https://doi.org/10.6084/m9.figshare.14410994. The data supporting the phylogeny of Ascomycota are available at https://doi.org/10.6084/m9.figshare.12751736.
Literature Cited
- Arzimanoglou II, Gilbert F, Barber HRK.. 1998. Microsatellite instability in human solid tumors. Cancer 82(10):1808–1820. [DOI] [PubMed] [Google Scholar]
- Ashburner M, et al. 2000. Gene ontology: tool for the unification of biology. Nat Genet. 25(1):25–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beier S, Thiel T, Münch T, Scholz U, Mascher M.. 2017. MISA-web: a web server for microsatellite prediction. Bioinformatics 33(16):2583–2585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Billmyre RB, Applen Clancey S, Li LX, Doering TL, Heitman J.. 2020. 5-Fluorocytosine resistance is associated with hypermutation and alterations in capsule biosynthesis in Cryptococcus. Nat Commun. 11(1):127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Billmyre RB, Clancey SA, Heitman J.. 2017. Natural mismatch repair mutations mediate phenotypic diversity and drug resistance in Cryptococcus deuterogattii. Elife 6:e28802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boland CR, Goel A.. 2010. Microsatellite instability in colorectal cancer. Gastroenterology 138(6):2073–2087.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Campbell BB, et al. 2017. Comprehensive analysis of hypermutation in human cancer. Cell 171(5):1042–1056.e10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang D-Y, Gu Y, Lu A-L.. 2001. Fission yeast (Schizosaccharomyces pombe) cells defective in the MutY-homologous glycosylase activity have a mutator phenotype and are sensitive to hydrogen peroxide. Mol Genet Genomics. 266(2):336–342. [DOI] [PubMed] [Google Scholar]
- Cherry JM, et al. 2012. Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 40(Database issue):D700–D705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cock PJA, et al. 2009. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25(11):1422–1423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dahal BK, et al. 2017. Involvement of DNA mismatch repair in the maintenance of heterochromatic DNA stability in Saccharomyces cerevisiae. PLoS Genet. 13(10):e1007074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Derilus D, Rahman MZ, Serrano AE, Massey SE.. 2021. Proteome size reduction in Apicomplexans is linked with loss of DNA repair and host redundant pathways. Infect Genet Evol. 87:104642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dos Reis TF, et al. 2019. The Aspergillus fumigatus mismatch repair MSH2 homolog is important for virulence and azole resistance. mSphere 4(4):e0041619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ellegren H. 2004. Microsatellites: simple sequences with complex evolution. Nat Rev Genet. 5(6):435–445. [DOI] [PubMed] [Google Scholar]
- Ellingham O, David J, Culham A.. 2019. Enhancing identification accuracy for powdery mildews using previously underexploited DNA loci. Mycologia 111(5):798–812. [DOI] [PubMed] [Google Scholar]
- Friedberg EC, et al. 2005. DNA repair and mutagenesis. Washington, DC: ASM Press. doi: 10.1128/9781555816704. [DOI] [Google Scholar]
- Fukui K. 2010. DNA mismatch repair in eukaryotes and bacteria. J Nucleic Acids. 2010:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerik KJ, Li X, Pautz A, Burgers PMJ.. 1998. Characterization of the two small subunits of Saccharomyces cerevisiae DNA polymerase. J Biol Chem. 273(31):19747–19755. [DOI] [PubMed] [Google Scholar]
- Gianoulis TA, et al. 2012. Genomic analysis of the hydrocarbon-producing, cellulolytic, endophytic fungus Ascocoryne sarcoides. PLoS Genet. 8(3):e1002558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giglia-Mari G, Zotter A, Vermeulen W.. 2011. DNA damage response. Cold Spring Harb Perspect Biol. 3(1):a000745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gill EE, Fast NM.. 2007. Stripped-down DNA repair in a highly reduced parasite. BMC Mol Biol. 8:24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hershberg R, Petrov DA.. 2010. Evidence that mutation is universally biased towards AT in bacteria. PLoS Genet. 6(9):e1001115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hsieh P, Zhang Y.. 2017. The Devil is in the details for DNA mismatch repair. Proc Natl Acad Sci U S A. 114(14):3552–3554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Inuma T, Khodaparast SA, Takamatsu S.. 2007. Multilocus phylogenetic analyses within Blumeria graminis, a powdery mildew fungus of cereals. Mol Phylogenet Evol. 44(2):741–751. [DOI] [PubMed] [Google Scholar]
- Johnson LS, Eddy SR, Portugaly E.. 2010. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics 11:431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones L, et al. 2014. Adaptive genomic structural variation in the grape powdery mildew pathogen, Erysiphe necator. BMC Genomics 15:1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kanehisa M, Goto S.. 2000. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28(1):27–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Misawa K, Kuma K, Miyata T.. 2002. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14):3059–3066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keightley PD, et al. 2009. Analysis of the genome sequences of three Drosophila melanogaster spontaneous mutation accumulation lines. Genome Res. 19(7):1195–1201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kriventseva EV, et al. 2019. OrthoDB v10: sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs. Nucleic Acids Res. 47(D1):D807–D811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kunkel TA, Erie DA.. 2005. DNA mismatch repair. Annu Rev Biochem. 74:681–710. [DOI] [PubMed] [Google Scholar]
- Lang GI, Parsons L, Gammie AE.. 2013. Mutation rates, spectra, and genome-wide distribution of spontaneous mutations in mismatch repair deficient yeast. G3 (Bethesda) 3(9):1453–1465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Letunic I, Bork P.. 2019. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 47(W1):W256–W259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, et al. 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25(16):2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lock A, et al. 2019. PomBase 2018: user-driven reimplementation of the fission yeast database provides rapid and intuitive access to diverse, interconnected information. Nucleic Acids Res. 47(D1):D821–D827. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lynch M. 2010. Rate, molecular spectrum, and consequences of human mutation. Proc Natl Acad Sci U S A. 107(3):961–968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lynch M, et al. 2008. A genome-wide view of the spectrum of spontaneous mutations in yeast. Proc Natl Acad Sci U S A. 105(27):9272–9277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marti TM, Kunz C, Fleck O.. 2002. DNA mismatch repair and mutation avoidance pathways. J Cell Physiol. 191(1):28–41. [DOI] [PubMed] [Google Scholar]
- Midgley DJ, et al. 2016. Genomic insights into the carbohydrate catabolism of Cairneyella variabilis gen. nov. sp. nov., the first reports from a genome of an ericoid mycorrhizal fungus from the southern hemisphere. Mycorrhiza 26(4):345–352. [DOI] [PubMed] [Google Scholar]
- Milo S, Harari-Misgav R, Hazkani-Covo E, Covo S.. 2019. Limited DNA repair gene repertoire in ascomycete yeast revealed by comparative genomics. Genome Biol Evol. 11(12):3409–3423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Möller M, et al. 2021. Recent loss of the Dim2 DNA methyltransferase decreases mutation rate in repeats and changes evolutionary trajectory in a fungal pathogen. PLoS Genet. 17(3):e1009448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nenarokova A, et al. 2019. Causes and effects of loss of classical nonhomologous end joining pathway in parasitic eukaryotes. MBio 10(4):e01541–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pearson WR. 2013. An introduction to sequence similarity (“homology”) searching. Curr Protoc Bioinforma. 42:3.1.1–3.1.8. [CVOCROSSCVO] [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rhodes J, et al. 2017. A population genomics approach to assessing the genetic basis of within-host microevolution underlying recurrent cryptococcal meningitis infection. G3 (Bethesda) 7(4):1165–1176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richman S. 2015. Deficient mismatch repair: read all about it (Review). Int J Oncol. 47(4):1189–1202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt SM, Panstruga R.. 2011. Pathogenomics of fungal plant parasites: what have we learnt about pathogenesis? Curr Opin Plant Biol. 14(4):392–399. [DOI] [PubMed] [Google Scholar]
- Schofield MJ, Hsieh P.. 2003. DNA mismatch repair: molecular mechanisms and biological function. Annu Rev Microbiol. 57:579–608. [DOI] [PubMed] [Google Scholar]
- Seena S, Duarte S, Pascoal C, Cássio F.. 2012. Intraspecific variation of the aquatic fungus Articulospora tetracladia: an ubiquitous perspective. PLoS One 7(4):e35884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen X-X, et al. 2020. Genome-scale phylogeny and contrasting modes of genome evolution in the fungal phylum Ascomycota. Sci Adv. 6(45):eabd0079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spanu PD, et al. 2010. Genome expansion and gene loss in powdery mildew fungi reveal tradeoffs in extreme parasitism. Science 330(6010):1543–1546. [DOI] [PubMed] [Google Scholar]
- Steenwyk JL. 2020. JLSteenwyk/ggpubfigs: first release of ggpubfigs. doi: 10.5281/ZENODO.4126988. [DOI] [Google Scholar]
- Steenwyk JL. 2021. Evolutionary divergence in DNA damage responses among fungi. MBio 12(2):e0334820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steenwyk JL, et al. 2021. PhyKIT: a broadly applicable UNIX shell toolkit for processing and analyzing phylogenomic data. Bioinformatics 37(16):2325–2331. doi: 10.1093/bioinformatics/btab096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steenwyk JL, Opulente DA, et al. 2019. Extensive loss of cell-cycle and DNA repair genes in an ancient lineage of bipolar budding yeasts. PLoS Biol. 17(5):e3000255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steenwyk JL, Shen X-X, Lind AL, Goldman GH, Rokas A.. 2019. A robust phylogenomic time tree for biotechnologically and medically important fungi in the genera Aspergillus and Penicillium. MBio 10(4):e00925–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun L, et al. 2016. Preferential protection of genetic fidelity within open chromatin by the mismatch repair machinery. J Biol Chem. 291(34):17692–17705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takamatsu S, et al. 2015. First comprehensive phylogenetic analysis of the genus Erysiphe (Erysiphales, Erysiphaceae) I. The Microsphaera lineage. Mycologia 107(3):475–489. [DOI] [PubMed] [Google Scholar]
- Temnykh S. 2001. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Res. 11(8):1441–1452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- The Gene Ontology Consortium. 2019. The gene ontology resource: 20 years and still GOing strong. Nucleic Acids Res. 47(D1):D330–D338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Wyk S, Wingfield BD, De Vos L, van der Merwe NA, Steenkamp ET.. 2020. Genome-wide analyses of repeat-induced point mutations in the Ascomycota. Front Microbiol. 11:622368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vielba-Fernández A, et al. 2020. Fungicide resistance in powdery mildew fungi. Microorganisms 8(9):1431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walker AK, et al. 2016. Full genome of Phialocephala scopiformis DAOMC 229536, a fungal endophyte of spruce producing the potent anti-insectan compound rugulosin. Genome Announc. 4(1):e01768–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waterhouse RM, et al. 2018. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. 35(3):543–548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wickerham H. 2016. ggplot2: elegant graphics for data analysis. New York: Springer-Verlag. Available from: https://ggplot2.tidyverse.org. [Google Scholar]
- Yang Z. 2007. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 24(8):1586–1591. [DOI] [PubMed] [Google Scholar]
- Yoon B-J. 2009. Hidden Markov models and their applications in biological sequence analysis. Curr Genomics. 10(6):402–415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu YO, Siegal ML, Hall DW, Petrov DA.. 2014. Precise estimates of mutation rate and spectrum in yeast. Proc Natl Acad Sci U S A. 111(22):E2310–E2318. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Supporting statistical analysis, the Ascomycota phylogeny with species names, and two supplementary data files (MMR gene presence/absence matrix and ω output) are available via figshare at https://doi.org/10.6084/m9.figshare.14410994. The data supporting the phylogeny of Ascomycota are available at https://doi.org/10.6084/m9.figshare.12751736.