Abstract
Newly emerged wheat blast disease is a serious threat to global wheat production. Wheat blast is caused by a distinct, exceptionally diverse lineage of the fungus causing rice blast disease. Through sequencing a recent field isolate, we report a reference genome that includes seven core chromosomes and mini-chromosome sequences that harbor effector genes normally found on ends of core chromosomes in other strains. No mini-chromosomes were observed in an early field strain, and at least two from another isolate each contain different effector genes and core chromosome end sequences. The mini-chromosome is enriched in transposons occurring most frequently at core chromosome ends. Additionally, transposons in mini-chromosomes lack the characteristic signature for inactivation by repeat-induced point (RIP) mutation genome defenses. Our results, collectively, indicate that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini-chromosomes and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.
Author summary
The emerging blast disease on wheat is proving even harder to control than the ancient, still-problematic rice blast disease. Potential wheat resistance identified using strains isolated soon after disease emergence are no longer effective in controlling recent aggressive field isolates from wheat in South America and South Asia. We construct a high-quality assembly of an aggressive, recently-isolated wheat blast fungal strain and the first assembled mini-chromosome genome sequence of wheat and rice blast pathogens. We report that recent wheat pathogens can contain one or two highly-variable dispensable mini-chromosomes, each with an amalgamation of fungal effector genes and other sequences that are duplicated or absent from indispensable core chromosome ends. Well-studied effectors found on different core chromosomes in rice pathogens appear side-by-side in wheat pathogen mini-chromosomes. The rice pathogen often overcomes deployed resistance genes by deleting triggering effector genes. We propose that the fast-evolving effector-rich compartment of the wheat blast fungus is a combination of core chromosome ends and mobile mini-chromosomes that are easily lost from individual strains. Localization of effectors on mini-chromosomes would therefore accelerate pathogen adaptation in the field.
Introduction
Wheat blast is an explosive emerging disease capable of 100% yield losses. Little resistance is available in cultivated wheat varieties, and fungicides are not effective under disease favorable conditions [1,2]. The disease emerged in Brazil in 1985 and spread within South America, limiting wheat production (Fig 1). Wheat blast jumped continents in 2016, causing major yield losses in Bangladesh with this first report [3,4]. Wheat blast has now established in South Asia, enhancing fears about further disease spread, disruption of global grain trade by this seed-borne pathogen, and endangerment of global food security [5]. Wheat blast is caused by a wheat-adapted lineage of Magnaporthe oryzae (synonymous with Pyricularia oryzae) [6], known as the Triticum pathotype (MoT). MoT strains are distinct from rice pathogens in the M. oryzae Oryza pathotype (MoO) and millet pathogens in the Eleusine (MoE) and Setaria (MoS) pathotypes (S1 Fig). A serious turf grass disease emerged in the United States in the late 1980s, caused by the Lolium pathotype (MoL) with ryegrass as its major host. Although some MoL strains can infect wheat [7], MoT strains are distinguished as highly aggressive wheat pathogens that are so far restricted to certain countries in South America and South Asia (Fig 1A).
Although little is known about wheat blast, studies on rice blast disease have identified numerous effector genes, generally encoding small proteins that are specifically expressed in planta and play roles in host invasion [8–10]. Some effectors, termed avirulence (AVR) effectors, determine either rice cultivar or host species specificity through blocking infection upon recognition by corresponding cultivar- or species-specific resistance (R) genes and triggering hypersensitive resistance. For example, strains of several M. oryzae pathotypes are not able to infect weeping lovegrass, Eragrostis curvula, because they carry a host species-specific AVR effector PWL2 [11,12]. Planting of wheat varieties lacking the R gene Rwt3 in Brazil likely enabled MoL strains with the corresponding host species-specific AVR effector PWT3 to adapt to wheat, and subsequent loss of PWT3 function played a role in the wider emergence of the MoT subgroup [13]. So far, characterization of 11 MoO AVR effectors together with their corresponding R gene products has identified direct or indirect protein interactions that control rice cultivar specificity [9,14]. In contrast, understanding how individual effectors function in host invasion has been difficult due to apparent functional redundancy. That is, deletion of individual effector genes rarely dramatically impacts the pathogen's ability to cause disease.
Effector genes in diverse filamentous eukaryotic pathogens generally reside in rapidly evolving, transposon-rich chromosomal regions, which, together with slowly evolving core chromosome regions containing housekeeping genes, results in a 'two-speed' genome [15,16]. M. oryzae effectors from the Oryza pathotype are known to reside in transposon-rich regions, often near chromosome ends [9,17]. Two AVR effector genes [18,19] have been localized to dispensable mini-chromosomes (also known as supernumerary, accessory or B chromosomes [16,20,21]) that show non-mendelian inheritance and are present in some, but not all individuals in a population [22–24]. Effectors are associated with frequent presence/absence polymorphisms between and/or within the different M. oryzae lineages [18,25]. Deletion of the corresponding AVR effector gene could be a response to deploying R genes in a crop. In one well-studied case, AVR-Pita1, which corresponds to the periodically-deployed Pita rice R gene, has been mobile in the M. oryzae genome [18]. Specifically, AVR-Pita1 is found on different chromosomes in different strains, often near telomeres, and sometimes on mini-chromosomes. Understanding AVR effector gene dynamics is key to combating the ability of the blast fungus to rapidly overcome deployed R genes and to developing sustainable disease control.
Wheat blast disease is proving even harder to control than the ancient, still-problematic rice blast disease. Potential wheat resistance identified using strains isolated soon after disease emergence in 1985 are no longer effective in controlling recent aggressive field isolates from wheat in South America and South Asia. The global threat now posed by wheat blast disease makes it critical to generate genomic resources to further understand the wheat blast fungus. Here, a reference genome of an aggressive MoT strain was generated and compared to genomes of early and recent wheat pathogens and other host-adapted strains. We report that the genome structures of the 7 wheat blast core chromosomes have not diverged significantly from the rice blast core chromosomes. However, mini-chromosomes present in zero, one or two copies in different strains serve as a highly variable compartment for effector genes.
Results
A reference genome of the MoT strain B71
We sequenced and generated a near-complete genome assembly of the highly aggressive Bolivian field isolate B71 [4,26], which exhibits high sequence similarity with MoT isolates from Bangladesh (Fig 1A, S1 Table and S1 Fig). An assemblage containing 31 contigs (S2 Table) was produced from >12.4 Gb of whole genome shotgun (WGS) PacBio long reads (S2 Fig). Genome polishing utilizing ~10 Gb Illumina sequencing data corrected 37,982 small insertions and deletions as well as 350 base-pair substitutions in the PacBio draft assembly (S1 Data). Corrected assembled contigs were in the range of 44.2% to 52.5% GC content with the exception of a contig of 28.4%, which was predicted to be from mitochondria of B71 owing to its high similarity (99% identity) to the mitochondrial sequence of M. oryzae rice pathogen 70–15 [27]. A circularized B71 mitochondrial sequence was obtained after removing redundant sequences at the contig ends.
We developed a novel scaffolding technology, LIEP (Long Insert End-Pair sequencing) to improve the continuity of the assembly (Fig 2A). Briefly, LIEP involved construction of millions of vectors, each of which contains a unique DNA barcode pair of 22 nt and 21 nt random barcodes. Barcodes for each vector were sequenced to establish a sequence database of barcode pairs. The vectors were then used to construct clones with 20–30 kb long inserts of B71 genomic DNA flanked by the two vector barcodes. Both ends of the insert were sequenced, generating clone-end sequences with paired barcode sequences. Barcode sequences were used to recover clone-end pairs. All steps were performed with pooled clones rather than individual clones. After scaffolding, a small contig (~12 kb) with the poor support from Illumina reads was discarded. Scaffolding and filtering condensed the assembly to 12 contigs, which were then reoriented and renamed based on the MG8 genome assembly of rice pathogen 70–15 [27]. Consequently, the final B71 genome assembly (B71Ref1) is comprised of ~44.46 Mb in seven chromosomes and five unanchored scaffolds (Fig 2B).
Telomere repeat sequences (TTAGGG)n or M. oryzae telomeric retrotransposons (MoTeRs) that integrate in telomere repeats [28] were identified on both ends of chromosomes 2, 4, 5, 6, 7 and on one end of chromosome 1, indicating that B71Ref1 is a near end-to-end assembly. The B71Ref1 and MG8 assemblies show high end-to-end co-linearity for chromosomes 2, 4, 5, and 7 (Fig 2C, S2 Data). A two-megabase rearrangement was identified between chromosomes 1 and 6, of which part of chromosome 1 of MG8 was located on chromosome 6 of B71. The rearrangement was supported by eight pairs of LIEP sequences (S3 Fig) and by 50 single PacBio long reads. This rearrangement is not MoT specific because it was also observed in a MoO field isolate, evidenced by a long PacBio assembled sequence spanning both chromosome 1 and chromosome 6 of MG8 [29]. A large sequence in B71Ref1, from 1.3 to 2.9 Mb on chromosome 3, was absent in MG8. The unanchored 70–15 MG8 contig, supercont8.8, was mapped at the beginning of B71 chromosome 7, implying supercont8.8 is the missing end of chromosome 7 in the MG8 reference genome. None of five unanchored scaffolds of B71Ref1 can be mapped to MG8, with the requirement of, at minimum, a 10-kb match and 95% identity. Annotation of B71 identified 12,141 genes, with 1,726 harboring signal peptide domains (Fig 3, S3 and S4 Data). Of the 248 highly conserved core set of eukaryotic genes, 243 (98.0%) orthologs from the B71 annotation were identified by CEGMA, compared to 97.6% orthologs in MG8. Therefore, completeness and annotation of the B71 genome are at least comparable to that of MG8, which was produced using Sanger sequencing and multiple technologies.
Comparison of RNA-Seq data of MoT-infected wheat from the field in Bangladesh [3] and culture-grown MoT identified 335 and 153 genes that were only expressed in planta and in culture, respectively (SI Materials and Methods) (S5 Data). Secretion signal domains occurred in 173 in planta-specific genes, and in 18 culture-specific genes. The in planta-specific genes included homologs of five MoO effector genes, including PWL2 and PWL4 (an inappropriately expressed homolog from a weeping lovegrass pathogen that fails to block infection of Eragrotis spp.) [11,12], AVR-Pib and AVRPiz-t that determine rice cultivar specificity [30,31], and the cytoplasmic effector BAS1 [32] (Fig 3G and S5 Data). The remaining 168 in planta-specific genes were considered putative effectors (S6 Data). Both known and putative effector genes tended to be located towards the ends of core chromosomes (Fig 3G). We also generated RNA-Seq data from both B71 in planta leaf samples enriched with fungus at 40 hours post inoculation (HPI) and from B71 grown in liquid medium, which was referred to as the second RNA-Seq experiment. Differential expression analysis identified 2,891 up-regulated genes and 2,429 down-regulated genes of in planta B71 samples as compared to in vitro cultured samples. Considering genes with high fold changes in expression (at least 16x fold-change) between the two groups, we found many more highly up-regulated genes than highly down-regulated genes in planta (863 vs. 44). Of 174 known or putative effector genes, 110 were highly up-regulated at 40 HPI.
Abundant copy number variation among M. oryzae isolates
We sequenced eight additional field isolates, including less-aggressive early strain T25 isolated in Brazil in 1988 [26], five other MoT strains, a MoL strain, and a MoE strain (S1 Fig and S1 Table) [6]. A read depth approach was employed to detect genomic copy number variation (CNV) between B71 and each isolate, focusing on the identification of genomic regions with conserved copy number (CNequal), higher copy number (CNplus), or lower copy number (CNminus) in non-B71 isolates (S4 Fig). Among ~41.7 Mb of low repetitive regions, 36.4 Mb (87.3%) exhibited CNequal among all nine isolates. In total, 4.9 Mb (11.8%) displayed CNV between B71 and at least one other isolate, with 2.7 Mb (6.5%) being CNplus and 3.4 Mb (8.2%) CNminus (Fig 3D, 3E and 3F). Ten effector homologs [9] (PWL4, AVR-Pik-chr3, AVR-Pi54, BAS1-chr1, BAS2, BAS3, BAS4, AVR1-CO39, AVR-Pi9, and AVRPiz-t) resided in CNequal regions (chromosome identifier added to distinguish paralogs). Four (AVR-Pii-chr3, AVR-Pib, PWL2, and BAS1) were in CNminus regions and four (PWT3, AVR-Pii-scaf1, AVR-Pib, and AVR-Pik) in CNplus (S3 Table). CNV analysis of effector genes was supported by Illumina draft assemblies of the eight strains (S4 Table). Sequences from Illumina draft assemblies also showed sequence variation of some effector genes among these strains, such as DNA insertions in PWT3 and AVR-CO39, two AVR genes governing host specificity [13,33,34]. Thus, some AVR homologs are equal in copy number and highly conserved across all strains, while many are subject to sequence changes, including copy number changes. Of 1.2 Mb genomic sequences exhibiting CNplus in some isolates but CNminus in others, ~819 kb (68.5%) were from the five scaffolds (scaf1-5), which constitute only 4.3% of the genome. CNV variation of sequences in the B71 scaffolds indicated they are absent in the less aggressive MoT strain T25 (Fig 4A). The P3 and B71 comparison, however, suggested that most scaffold sequences are duplicated in P3, an aggressive isolate from Paraguay in 2012 (Fig 4B). In summary, extensive copy number variation was observed among M. oryzae field isolates, especially in five scaffolds that were not anchored to the seven chromosomes.
Dispensable mini-chromosomes of MoT strains
Variability in the five scaffolds led us to hypothesize that some or all scaffolds might correspond to mini-chromosome sequences in B71. Electrophoretic karyotypes of B71 using contour-clamped homogeneous electric field (CHEF) electrophoresis confirmed that B71, indeed, contained a mini-chromosome or multiple mini-chromosomes of ~2.0 Mb in size (Fig 4C). Mini- and core chromosomal DNAs were separately excised from the gel for Illumina sequencing. The five scaffolds were highly over-represented among reads obtained from the mini-chromosome DNA and highly under-represented among the core chromosome reads, confirming that all five scaffolds are from the mini-chromosome (Fig 4D). Roughly equal mean depths of B71 WGS reads mapped on all seven core chromosomes or the mini-chromosome supported that B71 contains a mini-chromosome. The mini-chromosome contains 192 protein-coding genes. Of those, 58.9% (113/192) of the genes were expressed (S5 Data). Approximately half expressed genes (N = 56) were highly regulated in expression with at least 16 fold changes comparing 40 HPI in planta samples with in vitro cultured samples, and, significantly, they were all up-regulated in planta, which indicated that genes in the mini-chromosome are likely to be associated with pathogenicity. Of 113 expressed genes, 23 were functionally annotated. Notably, the mini-chromosome contains four of all six genes in the genome that encode plasma membrane fusion proteins, and all four were highly up-regulated in planta at 40 HPI. Three functionally annotated genes exhibited in planta specific expression in the field samples or the B71 in planta leaf sheath samples, namely BSY92_12116, BSY92_11977, and BSY92_12070, encoding endochitinase B1, a gentisate 1,2-dioxygenase, and a heat-labile enterotoxin (a putative effector gene), respectively. A transcriptional regulatory gene, an Sge1 homologous gene (BSY92_12088), governing expression of secondary metabolite biosynthetic genes [35] was highly up-regulated in planta. Most other functionally annotated expressed genes are associated with putative enzymatic activities. A gene BSY92_11993 encoding ubiquitin-like-specific protease 2 was expressed in both in planta and in vitro cultured samples, but it was highly up-regulated in planta. Gene ontology (GO) enrichment analysis identified that cysteine-type peptidase activity (GO:0008234, p-value = 0.0001) was over-represented in genes on the mini-chromosome (S7 Data). Eight out of all 11 genes associated with cysteine-type peptidase activity are located on the mini-chromosome, and 7 out of these 8 were expressed in either in planta or in vitro cultured samples.
Known effector genes PWL2 and BAS1 (S5 and S6 Figs), which are located on different core chromosomes in MG8, were located immediately adjacent to one another and surrounded by various transposon sequences on the B71 mini-chromosome (Fig 5A). This configuration was supported by 211 PacBio long reads and by Sanger sequencing of a PCR product obtained with a PWL2 and BAS1 primer pair (S7 Fig). No PWL2 or BAS1 homologs, with at least 70% identity, were identified on core chromosomes, supported by an under-represented sequencing coverage on the PWL2 or BAS1 regions from CHEF sequencing of B71 core chromosomes (Fig 5D). Both genes exhibited in planta-specific expression on the mini-chromosome (Fig 5B and S8 Fig). Therefore, mini-chromosomes harbor effector genes that show similar in planta-specific expression patterns to effector genes residing on core chromosomes.
Further CHEF analyses showed no evidence of mini-chromosomes in T25 and supported at least two mini-chromosomes in P3, consistent with predictions from the CNV results. The P3 mini-chromosomes are ~1.5 Mb and ~3 Mb in length (Fig 4C). Sequences of both P3 mini-chromosomes exhibited similarities to the B71 mini-chromosome but also marked differences (Fig 4D and Fig 5E). The large P3 mini-chromosome contained both PWL2 and BAS1 genes (Fig 5C and 5D), plus it harbored ~33 kb (assembly location 6,007 to 6,039 kb) of duplicated DNA from a region near the end of chromosome 6. This duplicated DNA segment included a homolog of the MoO effector AVR-Pib [30]. In contrast, the small P3 mini-chromosome lacked the PWL2 and BAS1 genes, but it contained a duplication of approximately 0.39 Mb of the chromosome 7 end (assembly location ~3.65 to 4.04 Mb) (Fig 5D). Retention of this segment in the core chromosome explains the large CNplus segment at this region of P3 chromosome 7 (Fig 4B). The CNV result indicated that both sequences of ends of chromosome 6 and chromosome 7 found in separated mini-chromosomes have only one extra copy, supporting that P3 mostly likely has no more than two mini-chromosomes. Notably, this segment contained five putative effector genes and a homolog of the known MoO effector gene AVR-Pik [36]. Another notable region from the end of chromosome 3 was present in both P3 mini-chromosomes, but not present in the B71 mini-chromosome (Fig 4D). Sequencing P3 core chromosomes identified sequences homologous to the B71 mini-chromosome that were not present in B71 core chromosomes (Fig 4D). Taken together, these three MoT mini-chromosomes contain different sets of known or predicted effector genes and other core-chromosome end sequences, which are either missing or duplicated on the core chromosomes of the same or other strains. The highly variable structure of MoT mini-chromosomes indicates frequent acquisition of sequences from core chromosomal ends.
Repetitive sequences in B71 core- and mini-chromosomes
Repeat annotation showed approximately 12.9% of the B71 genome consisted of transposons and other repetitive elements, and transposons accounted for 9.7% and 52.8% of the core and mini-chromosomes, respectively (Fig 6A and S5 Table). Many of the transposons that were over-represented in the mini-chromosome occurred frequently on chromosome arms, particularly at chromosome ends (S9 Fig). Four transposon subclasses made up a greater proportion of the total transposon sequences on the mini-chromosome versus core chromosomes, including three LINEs (Tad1, Jockey and I) and the DNA transposon TcMar-Fot1 (Fig 6B). These four are among the top five elements enriched in the core chromosomal 20% ends relative to the 20% middle core chromosome regions (Fig 6C). Besides similarities in transposon composition between chromosome ends and the mini-chromosome, alignment of the B71 mini-chromosome sequence to core chromosomes identified duplications of >10 kb fragments with at least 95% identity. Duplications were located at ends of chromosomes 3, 4, and 7 (S9 Fig), and they were highly enriched for telomere-associated MoTeRs (LINE/CRE element). Therefore, a subset of MoT transposons is implicated in dynamic interactions between MoT mini-chromosomes and core chromosome ends.
Nucleotide composition analysis indicated that, overall, repetitive sequences along core chromosomes were highly negatively correlated with GC content (Fig 3B and 3C, S10 Fig). However, the highly negative correlation did not hold in the mini-chromosome, which is highly repetitive while maintaining relatively high GC content (S10 Fig). Repetitive sequences in many fungi, including M. oryzae MoT strains, are subject to repeat-induced point (RIP) mutation resulting in C-to-T or G-to-A transitions and, thereby, leading to reduced GC content [37–40]. Given higher GC content of repetitive sequences in the mini-chromosome versus core chromosomes, we explored the possibility of different levels of RIP in these genomic regions by assessing their RIP-type mutation rates. Of six high-abundance transposons examined, all exhibited reduced levels of RIP-type mutations in the mini-chromosome relative to core chromosomes (S11 Fig). We examined transposons MGR583 (LINE/Tad1 element) and Pot2 (DNA/TcMar-Fot1 element) that are present with multiple copies in both core and mini-chromosomes. RIP analysis indicated that no sequences of MGR583 (N = 7) or Pot2 (N = 22) from the mini-chromosome were subjected to extensive RIP-type mutations, while 14/20 MGR583 and 3/19 Pot2 from core chromosomes contained abundant RIP-type mutations (Fig 6D and S12 Fig). Therefore, unlike transposons in core chromosomes, transposons in MoT mini-chromosomes do not appear to be inactivated by the RIP genome defense mechanism.
Discussion
The B71 reference genome for the wheat blast fungus has shown a high degree of macrosynteny for the core chromosomes relative to the rice pathogen reference genome 70–15 (MG8), which supports the recent report maintaining M. oryzae as a single species [6]. In contrast, mini-chromosomes present in B71 and another recent MoT field isolate P3 (P3-large and P3-small mini-chromosomes) are highly variable, with each one containing shared and different MoO effector homologs, putative effector genes, and other sequences from core chromosome ends. The B71 and P3-large mini-chromosomes contain the only copies of known MoO effectors PWL2 and BAS1 in these strains and neither gene was present in the early MoT strain T25, which we show lacks mini-chromosomes. PWL2 and BAS1 are located on different core chromosomes in 70–15, but they are found side-by-side on the B71 mini-chromosome. Both effectors show similar in planta specific expression on the MoT mini-chromosomes and on the MoO core chromosomes. Only the P3-large mini-chromosome contains a homolog of the MoO AVR-Pib gene, and only the P3-small mini-chromosome contains a homolog of AVR-Pik. Each mini-chromosome contains many other sequences that are either duplicated from core chromosome ends or missing from core chromosomes altogether. In one case, a P3 core chromosome sequence was homologous to the B71 mini-chromosome but not present in B71 core chromosomes. Taken together, our findings provide new insight on the M. oryzae two-speed genome [15] previously known to involve effector localization in transposon-rich regions near chromosome ends. We expand understanding of this effector compartment to include two apparently interchangeable regions, non-dispensable core chromosome ends coupled to dispensable mini-chromosomes.
We show that the M. oryzae accessory mini-chromosomes have a unique set of properties relative to accessory chromosomes in other fungi, including the well-studied accessory chromosomes in Fusarium species [41,42] and in Zymoseptoria tritici (syn. Mycosphaerella graminicola) [43,44]. M. oryzae mini-chromosomes, like lineage-specific chromosomes in Fusarium spp. [41,42] and the mini-chromosome in Leptosphaeria maculans [45] contain multiple genes associated with virulence and host-specificity. However, extensive recombination with core chromosomes has been so far only observed in M. oryzae mini-chromosomes. The rich set of accessory chromosomes in Z. tritici lack genes with an obvious role in pathogenicity, although some contribute quantitative pathogenicity effects in some strains [46]. The Z. tritici accessory chromosomes appear to be relatively ancient based on apparent survival through at least one speciation event [43,46]. M. oryzae mini-chromosomes resemble the accessory chromosomes of F. poae in lacking signs of the fungal specific genome defense mechanism known as RIP [47], therefore differing from the mini-chromosome and AT-isochore regions of L. maculans for which RIP appears to be a major mechanism for effector gene mutation during response to R gene deployment [48,49]. The gene and transposable element crosstalk between the core and supernumerary genomes reported in F. poae does not preferentially involve effectors and core chromosomes ends such as we report for M. oryzae [47]. Although supernumerary chromosomes in many systems appear heterochromatic, with low levels of gene expression [21], effector genes in M. oryzae mini-chromosomes show in planta specific expression characteristic of these genes on core chromosomes. Therefore, mini-chromosomes in the wheat blast pathogen differ in degree of variability of effector gene content and extent of recombination with core chromosome ends compared to dispensable chromosomes characterized so far in other fungi.
The mechanism for sequence exchange between core- and mini-chromosomes is unknown. However, the enrichment in mini-chromosomes of multiple subclasses of LINE retro-transposons and a DNA transposon that are also enriched at core chromosome ends, points to a transposon-mediated recombination mechanism involving non-allelic homology. Such a mechanism has been shown to facilitate genome rearrangements in another phytopathogenic fungus [50]. In contrast to seemingly RIPed core chromosome copies, the multiple copies of both MGR583 (LINE element) and Pot2 (DNA element) in the mini-chromosomes are nearly devoid of RIP-type mutations. This suggests that transposons on the mini-chromosomes remain active, facilitating multiplication and recombination. Telomere-associated MoTeR elements, found in MoL strains but not in MoO strains, are present on MoT mini-chromosomes. MoTeR elements have been reported to account for the extreme sequence variability of MoL telomeres compared to MoO telomeres [28], suggesting these elements might enhance mini-chromosome dynamics in MoT and MoL strains through destabilization of telomere regions. Transposon-rich genomic regions have been linked to increased sequence and structural variation in fungal plant pathogens [15,25,46]. Therefore, transposon-rich mini-chromosomes that also carry a number of genes, including many putative effectors, likely serve as genomic hotspots promoting genomic variation. Exceptional genomic variation produced in mini-chromosomes, and capable of flowing into core chromosomes, could accelerate the evolutionary potential of the pathogen.
Dynamic interchange between mini-chromosomes and core chromosome ends would contribute to AVR-Pita1 effector gene mobility, which is especially characteristic of rice pathogens [18]. M. oryzae rice pathogens are notorious for their ability to rapidly overcome deployed R genes. AVR-Pita1 and AVR-Pita2, which each confer avirulence to rice carrying the corresponding Pita resistance gene, belong to a subtelomeric gene family (S4 Table) and show a high rate of spontaneous mutations, including frequent deletions [51,52]. AVR-Pita1 and AVR-Pita2 occur in zero, one or more copies in different M. oryzae isolates and show highly variable genomic locations, usually near ends on core chromosomes 1, 3, 5, 6, 7; in 3 separate locations on chromosome 4; and on supernumerary chromosomes [18]. In contrast, avr-pita3, which lacks AVR activity, is stably located on chromosome 7 across the host-adapted lineages of M. oryzae. Therefore, extremely high genomic mobility, particularly of AVR-Pita1, appears to be a response to the periodic deployment of the Pita gene in rice. Mini-chromosomes would provide a population-wide repository for AVR genes that are deleted from individual strains and a means for rapid loss of AVR gene function from individual strains, because mini-chromosomes are frequently lost during meiosis and mitosis [22–24,53]. Individual strains lacking AVR-Pita1 could regain it through acquiring AVR-Pita1 containing mini-chromosomes from other individuals through the parasexual cycle and lateral gene transfer [18]. This would explain how the gene became integrated into new locations on the core chromosomes, typically at chromosome ends. The dynamic coupling we report between mini-chromosomes and core chromosome ends supports the multiple translocation hypothesis for AVR genes responding to periodic negative selection pressure of R gene deployment. Collectively, we propose that the mini-chromosome plays a role for gene movements like a shuttle, in which mutation, duplication, loss, and rearrangements of DNA occur at a faster pace than normal genomic changes, hence, accelerating genomic evolution for adaptation.
Growing evidence suggests that avirulence-conferring PWL family members (S4 Table) may be undergoing multiple translocation similarly to AVR-Pita family members [18]. PWL2 from a rice isolate and PWL1 from an Eleusine isolate each confer avirulence toward Eragrostis spp. [11,12]. The well-studied PWL2 gene, like AVR-Pita1, occurs in zero to four copies in different strains and is subject to frequent spontaneous deletion [12]. Genetic analyses showed that PWL2 and PWL1 map to different chromosomal locations, with PWL1 linked to a telomere. Homology between the PWL2 and PWL1 genes begins 70 bp upstream of the PWL1 initiation codon and ends immediately after the stop codon, and sequences beyond this conserved region are completely unrelated. In contrast, the apparently allelic, non-AVR conferring PWL3 and PWL4 genes mapped to a third genomic region and share conserved flanking sequences [11]. Two copies of PWL2 are present on chromosomes 3 and 6 in the reference rice genome MG8, the intact PWL2 sequence was found in three assembled contigs of a highly aggressive rice isolate 98–06 [54], and we report that PWL2 resides on a mini-chromosome in some wheat pathogens. Further research is needed to track chromosomal dynamics of PWL2, as well as PWL1, in host-adapted forms of M. oryzae. AVR-Pita1 effector gene mobility is reported to be in response to periodic deployment of the corresponding Pita gene in rice, raising the question of comparable selection pressure that might be acting in the Eragrostis system. Introduction of weeping lovegrass, native to South Africa, and other Eragrostis spp., around the world for forage and erosion control in the past decades could have provided conditions promoting loss and recovery of PWL family members.
Our results will inspire further exploration of function and evolutionary roles of mini-chromosomes in the fungal phytopathosystem, and facilitate answering important questions for blast on wheat and other cereal crops. Our early MoT strain T25, isolated in 1988, lacks mini-chromosomes, as was previously reported for 7 other MoT strains isolated in Brazil between 1986 and 1988 [23]. This raises the question of whether mini-chromosomes have contributed in any way to the enhanced aggressiveness characteristic of recent field isolates such as B71 and P3. It is critical to monitor further evolution, including potential recombination with other M. oryzae pathotypes, of the complex MoT population in South America and the initially clonal MoT population in South Asia [1,4]. Localization of the PWL2 host species-specificity gene on mini-chromosomes in wheat pathogens raises the question of a role for mini-chromosomes in host jumps. Effector gene dynamics, so far only associated with a small number of MoO AVR effector genes corresponding to periodically deployed R genes, raises the question of what roles known MoO AVR effector homologs and BAS1 (lacking known AVR activity in MoO strains) play in wheat infection by MoT strains. Finally, it is critical to identify and deploy effective wheat blast resistance.
Materials and methods
Detailed description of materials and methods is included in S1 Text.
Genetic materials
All M. oryzae strains examined were field strains from South America (S1 Table). MoT isolates B71, T25, and P3 were isolated in Bolivia (2012), Brazil (1988), and Paraguay (2012), respectively. All work with living wheat blast fungus in the U.S. was performed with proper USDA-APHIS permits and monitoring in BSL-3 laboratories in the Biosecurity Research Institute at Kansas State University.
DNA extraction
Single spore isolates of each pathogen strain were cultured in complete medium for mycelium propagation. Mycelium was harvested and frozen using liquid nitrogen. To avoid excessive mitochondrial DNA, mycelial nuclei were collected by gradient centrifugation as described [55]. The CTAB (cetyltrimethylammonium bromide) DNA extraction method was applied to isolate genomic DNA from the nuclear samples [56].
B71 genome sequencing and assembly
The 3–20 kb WGS libraries were constructed using B71 nuclear genomic DNAs. The library was sequenced with P6-C4 chemistry on ten SMRTcells of PacBio RS II. Nuclear genomic DNAs were also subjected to 2x250 bp paired-end Illumina sequencing. To increase the assembly continuity, LIEP was devised and used to generate 20–30 kb long-distance paired sequences for scaffolding. PacBio long reads were assembled using the Canu pipeline [57]. Self-correction using PacBio reads did not correct all PacBio sequencing errors. Illumina reads and the Illumina assembly sequences assembled using DISCOVAR de novo [58] were both utilized for further error correction. The resulting assembled contigs were scaffolded using LIEP long-distance paired sequences with the software SSPACE [59].
The first and second RNA-Seq experiments
Two RNA-Seq experiments were performed. In the first RNA-Seq experiment, an in vitro cultured mycelium sample was collected for the total RNA extraction using RNeasy Plant Mini Kit. Total RNA was used for RNA sequencing on a MiSeq to generate 2x150bp paired-end data. Clean data after adaptor and quality trimming were de novo assembled using Trinity [60], which were then aided in genome annotation.
In the second RNA-Seq experiment, we attempted to compare B71 gene expression in planta and in vitro culture with three biological replicates in each group. RNAs of in planta samples were isolated from B71-infected epidermal cells of leaf sheaths from 3–4 weeks old wheat plants at 40 HPI. The B71 in vitro culture RNAs were extracted from mycelium grown in liquid swirling cultures with minor modifications to the method of Mosquera et al. [32]. The total RNAs were subjected to library preparation for mRNA sequencing to produce single-end 75bp reads. Clean data after adaptor and quality trimming were aligned to the B71Ref1 reference genome with STAR [61]. Read counts per genes were used for differential expression analysis with DESeq2 [62] with 1% false discovery rate (FDR) as the threshold to declare significantly differentially expressed genes between in planta and in vitro culture groups [63].
Genome annotation
A Maker pipeline was used for the B71 genome annotation [64]. Both evidence-driven prediction and ab initio gene prediction were employed [65]. Transcriptional evidence was provided using assembled sequences from RNA sequencing data of the B71 strain that was cultured in media and field wheat leaf samples infected by Bangladesh wheat blast strains, which were genetically almost identical to B71. CEGMA was used to assess the completeness of the genome assembly or annotation [66].
Identification of expressed genes and putative effectors
Publically available RNA-Seq data of MoT infected wheat were used as in planta expression data to compare with in vitro culture RNA-Seq data from the first RNA-Seq experiment. Field RNA-Seq data includes samples 5 and 7 from Bangladesh wheat fields [3]. These MoT isolates have been demonstrated to be almost identical to B71. All data from samples 5 and 7 were merged to represent field in planta transcriptomes. Genes with read abundance higher than 0.1 FPKM (fragment per kilobase of coding sequence per million reads) in either in planta or in culture samples were considered to be expressed genes. Genes with read abundance higher than 1 FPKM from the in planta data set but no reads from the cultured sample were considered to be in planta specific expression. In planta specific genes containing classical signal peptide domains [67] were considered putative effectors.
Analysis of copy number variation between strains
Read depth approach was employed to identify CNV between each of some M. oryzae strains and B71 for each of sequence bins (e.g, 300 bp). Segmentation with the R package of DNACopy was performed to identify genomic CNV segments merged from multiple bins [68].
CHEF karyotypes of MoT strains and mini-chromosome sequencing
MoT protoplasts were prepared and mixed with 1.5% low melting-temperature agarose [23]. Suspensions were loaded into disposable plug molds. Protoplasts in plugs were lysed with proteinase K and washed. A Biorad CHEF electrophoresis system was used for separating chromosomes embedded in the plugs. After the CHEF gel electrophoresis, DNAs from individual mini-chromosomes, and from core chromosomes as one unit, were excised and purified from the agarose gels. Purified DNAs were subjected to Illumina 2x151 bp paired-end sequencing.
Analyses of repetitive sequences
Repetitive sequences were identified using MGEScan [69], LTR_Finder [70], LTRharvest [71,72], and RepeatModeler (github.com/rmhubley/RepeatModeler). Merging discovered repetitive sequences and previously characterized M. oryzae repeats [73] produced a non-redundant database, which served as a repeat library to identify repeats in the B71 genome using RepeatMasker (www.repeatmasker.org). Some transposable elements were subjected to analysis of RIP-type polymorphisms, nucleotide changes of C to T or G to A.
Supporting information
Acknowledgments
We thank Drs. Nicholas J Talbot and Yukio Tosa for comments prior to manuscript submission and thank Dr. Alina Akhunova at the Integrated Genomics Facility at Kansas State University for technical support. Photos in Fig 1B and 1C are from Guillermo Isidoro Barea Vargas, Coperagro SRL, Bolivia. The photo in Fig 1D is from Javier Toledo, Agripac Boliviana, Bolivia. This is the contribution number 18-336-J from the Kansas Agricultural Experiment Station.
Data Availability
Sequencing raw data are available from Sequence Read Archive: PRJNA355407. Briefly, PacBio WGS sequencing raw data were deposited at SRR6232283 to SRR6232292. Illumina WGS sequencing raw data were deposited at SRR6232156. LIEP data were deposited at SRR6690042 and SRR6690043. The first cultured RNA-Seq reads were deposited at SRR9126640, and the second RNA-Seq reads were deposited at SRR9127597 to SRR9127602. The Genbank accession number for the B71 genome assembly is MQUD00000000. Custom scripts used in the manuscript are available at Github (https://github.com/liu3zhenlab/manuscripts).
Funding Statement
Awards 2013-68004-20378 (BV) and 2018-67013-28511 (SL) were funded from the USDA National Institute of Food and Agriculture. Award IOS-1741090 (SL) was funded by the Plant Genome Research Program from the US National Science Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Cruz CD, Valent B (2017) Wheat blast disease: danger on the move. Tropical Plant Pathology 42: 210–222. [Google Scholar]
- 2.Kohli MM, Mehta YR, Guzman E, Viedma L, Cubilla LE (2011) Pyricularia blast—a threat to wheat cultivation. Czech Journal of Genetics and Plant Breeding 47: S130–S134. [Google Scholar]
- 3.Islam MT, Croll D, Gladieux P, Soanes DM, Persoons A, et al. (2016) Emergence of wheat blast in Bangladesh was caused by a South American lineage of Magnaporthe oryzae. BMC Biology 14: 84 10.1186/s12915-016-0309-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Malaker PK, Barma NCD, Tiwari TP, W.J.Collis WJ, Duveiller E, et al. (2016) First report of wheat blast caused by Magnaporthe oryzae pathotype triticum in Bangladesh. Plant Disease 100: 2330 [Google Scholar]
- 5.Mottaleb KA, Singh PK, Sonder K, Kruseman G, Tiwari TP, et al. (2018) Threat of wheat blast to South Asia’s food security: An ex-ante analysis. PLoS ONE 13: e0197555 10.1371/journal.pone.0197555 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gladieux P, Condon B, Ravel S, Soanes D, Nunes Maciel JL, et al. (2018) Gene flow between divergent cereal- and grass-specific lineages of the rice blast fungus Magnaporthe oryzae. mBio 9:e01219–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Farman M, Peterson GL, Chen L, Starnes JH, Valent B, et al. (2017) The Lolium pathotype of Magnaporthe oryzae recovered from a single blasted wheat plant in the United States. Plant Disease 101: 684–692. 10.1094/PDIS-05-16-0700-RE [DOI] [PubMed] [Google Scholar]
- 8.Giraldo MC, Valent B (2013) Filamentous plant pathogen effectors in action. Nature Reviews Microbiology 11: 800–814. 10.1038/nrmicro3119 [DOI] [PubMed] [Google Scholar]
- 9.Wang B, Ebbole DJ, Wang Z (2017) The arms race between Magnaporthe oryzae and rice: diversity and interaction of Avr and R genes. Journal of Integrative Agriculture 16: 2746–2760. [Google Scholar]
- 10.Liu WD, Liu JL, Triplett L, Leach JE, Wang GL (2014) Novel insights into rice innate immunity against bacterial and fungal pathogens. Annual Review of Phytopathology 52: 213–241. 10.1146/annurev-phyto-102313-045926 [DOI] [PubMed] [Google Scholar]
- 11.Kang S, Sweigard JA, Valent B (1995) The PWL host specificity gene family in the blast fungus Magnaporthe grisea. Molecular Plant Microbe Interactions 8: 939–948. [DOI] [PubMed] [Google Scholar]
- 12.Sweigard JA, Carroll AM, Kang S, Farrall L, Chumley FG, et al. (1995) Identification, cloning, and characterization of Pwl2, a gene for host species-specificity in the rice blast fungus. The Plant Cell 7: 1221–1233. 10.1105/tpc.7.8.1221 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Inoue Y, Vy TTP, Yoshida K, Asano H, Mitsuoka C, et al. (2017) Evolution of the wheat blast fungus through functional losses in a host specificity determinant. Science 357: 80–83. 10.1126/science.aam9654 [DOI] [PubMed] [Google Scholar]
- 14.Kroj T, Chanclud E, Michel‐Romiti C, Grand X, Morel JB (2016) Integration of decoy domains derived from protein targets of pathogen effectors into plant immune receptors is widespread. New Phytologist 210: 618–626. 10.1111/nph.13869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Raffaele S, Kamoun S (2012) Genome evolution in filamentous plant pathogens: why bigger can be better. Nature Reviews Microbiology 10: 417–430. 10.1038/nrmicro2790 [DOI] [PubMed] [Google Scholar]
- 16.Bertazzoni S, Williams AH, Jones DA, Syme RA, Tan KC, et al. (2018) Accessories make the outfit: Accessory chromosomes and other dispensable DNA regions in plant-pathogenic fungi. Molecular Plant Microbe Interactions 31: 779–788. 10.1094/MPMI-06-17-0135-FI [DOI] [PubMed] [Google Scholar]
- 17.Valent B, Khang CH (2010) Recent advances in rice blast effector research. Current Opinion in Plant Biology 13: 434–441. 10.1016/j.pbi.2010.04.012 [DOI] [PubMed] [Google Scholar]
- 18.Chuma I, Isobe C, Hotta Y, Ibaragi K, Futamata N, et al. (2011) Multiple translocation of the AVR-Pita effector gene among chromosomes of the rice blast fungus Magnaporthe oryzae and related species. PLoS Pathogens 7: e1002147 10.1371/journal.ppat.1002147 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Luo CX, Yin LF, Ohtaka K, Kusaba M (2007) The 1.6Mb chromosome carrying the avirulence gene AvrPik in Magnaporthe oryzae isolate 84R-62B is a chimera containing chromosome 1 sequences. Mycological Research 111: 232–239. 10.1016/j.mycres.2006.10.008 [DOI] [PubMed] [Google Scholar]
- 20.Mehrabi R, Mirzadi Gohari A, Kema GHJ (2017) Karyotype variability in plant-pathogenic fungi. Annual Review of Phytopathology 55: 483–503. 10.1146/annurev-phyto-080615-095928 [DOI] [PubMed] [Google Scholar]
- 21.Soyer JL, Balesdent M-H, Rouxel T, Dean RA (2018) To B or not to B: a tale of unorthodox chromosomes. Current Opinion in Microbiology 46: 50–57. 10.1016/j.mib.2018.01.012 [DOI] [PubMed] [Google Scholar]
- 22.Chuma I, Tosa Y, Taga M, Nakayashiki H, Mayama S (2003) Meiotic behavior of a supernumerary chromosome in Magnaporthe oryzae. Current Genetics 43: 191–198. 10.1007/s00294-003-0390-7 [DOI] [PubMed] [Google Scholar]
- 23.Orbach MJ, Chumley FG, Valent B (1996) Electrophoretic karyotypes of Magnaporthe grisea pathogens of diverse grasses. Molecular Plant Microbe Interactions 9: 261–271. [Google Scholar]
- 24.Talbot NJ, Salch YP, Ma M, Hamer JE (1993) Karyotypic variability within clonal lineages of the rice blast fungus, Magnaporthe grisea. Applied and Environmental Microbiology 59: 585–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yoshida K, Saunders DG, Mitsuoka C, Natsume S, Kosugi S, et al. (2016) Host specialization of the blast fungus Magnaporthe oryzae is associated with dynamic gain and loss of genes linked to transposable elements. BMC Genomics 17: 370 10.1186/s12864-016-2690-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Cruz CD, Peterson GL, Bockus WW, Kankanala P, Dubcovsky J, et al. (2016) The 2NS translocation from Aegilops ventricosa confers resistance to the Triticum Pathotype of Magnaporthe oryzae. Crop Science 56: 990–1000. 10.2135/cropsci2015.07.0410 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dean RA, Talbot NJ, Ebbole DJ, Farman ML, Mitchell TK, et al. (2005) The genome sequence of the rice blast fungus Magnaporthe grisea. Nature 434: 980–986. 10.1038/nature03449 [DOI] [PubMed] [Google Scholar]
- 28.Starnes JH, Thornbury DW, Novikova OS, Rehmeyer CJ, Farman ML (2012) Telomere-targeted retrotransposons in the rice blast fungus Magnaporthe oryzae: agents of telomere instability. Genetics 191: 389–406. 10.1534/genetics.111.137950 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bao J, Chen M, Zhong Z, Tang W, Lin L, et al. (2017) PacBio sequencing reveals transposable element as a key contributor to genomic plasticity and virulence variation in Magnaporthe oryzae. Molecular Plant 10: 1465–1468. 10.1016/j.molp.2017.08.008 [DOI] [PubMed] [Google Scholar]
- 30.Zhang S, Wang L, Wu W, He L, Yang X, et al. (2015) Function and evolution of Magnaporthe oryzae avirulence gene AvrPib responding to the rice blast resistance gene Pib. Scientific Reports 5: 11642 10.1038/srep11642 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Li W, Wang B, Wu J, Lu G, Hu Y, et al. (2009) The Magnaporthe oryzae avirulence gene AvrPiz-t encodes a predicted secreted protein that triggers the immunity in rice mediated by the blast resistance gene Piz-t. Molecular Plant Microbe Interactions 22: 411–420. 10.1094/MPMI-22-4-0411 [DOI] [PubMed] [Google Scholar]
- 32.Mosquera G, Giraldo MC, Khang CH, Coughlan S, Valent B (2009) Interaction transcriptome analysis identifies Magnaporthe oryzae BAS1-4 as biotrophy-associated secreted proteins in rice blast disease. Plant Cell 21: 1273–1290. 10.1105/tpc.107.055228 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Tosa Y, Osue J, Eto Y, Oh HS, Nakayashiki H, et al. (2005) Evolution of an avirulence gene, AVR1-CO39, concomitant with the evolution and differentiation of Magnaporthe oryzae. Mol Plant Microbe Interact 18: 1148–1160. 10.1094/MPMI-18-1148 [DOI] [PubMed] [Google Scholar]
- 34.Zheng Y, Zheng W, Lin F, Zhang Y, Yi Y, et al. (2011) AVR1-CO39 is a predominant locus governing the broad avirulence of Magnaporthe oryzae 2539 on cultivated rice (Oryza sativa L.). Mol Plant Microbe Interact 24: 13–17. 10.1094/MPMI-10-09-0240 [DOI] [PubMed] [Google Scholar]
- 35.Brown DW, Busman M, Proctor RH (2014) Fusarium verticillioides SGE1 is required for full virulence and regulates expression of protein effector and secondary metabolite biosynthetic genes. Mol Plant Microbe Interact 27: 809–823. 10.1094/MPMI-09-13-0281-R [DOI] [PubMed] [Google Scholar]
- 36.Kanzaki H, Yoshida K, Saitoh H, Fujisaki K, Hirabuchi A, et al. (2012) Arms race co-evolution of Magnaporthe oryzae AVR-Pik and rice Pik genes driven by their physical interactions. Plant Journal 72: 894–907. 10.1111/j.1365-313X.2012.05110.x [DOI] [PubMed] [Google Scholar]
- 37.Gladyshev E (2017) Repeat-induced point mutation and other genome defense mechanisms in fungi. Microbiology Spectrum 5: 10.1128/microbiolspec.FUNK-0042-2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ikeda K, Nakayashiki H, Kataoka T, Tamba H, Hashimoto Y, et al. (2002) Repeat-induced point mutation (RIP) in Magnaporthe grisea: implications for its sexual cycle in the natural field context. Molecular Microbiology 45: 1355–1364. 10.1046/j.1365-2958.2002.03101.x [DOI] [PubMed] [Google Scholar]
- 39.Nakayashiki H, Nishimoto N, Ikeda K, Tosa Y, Mayama S (1999) Degenerate MAGGY elements in a subgroup of Pyricularia grisea: a possible example of successful capture of a genetic invader by a fungal genome. Molecular and General Genetics 261: 958–966. 10.1007/s004380051044 [DOI] [PubMed] [Google Scholar]
- 40.Hane JK, Oliver RP (2008) RIPCAL: a tool for alignment-based analysis of repeat-induced point mutations in fungal genomic sequences. BMC Bioinformatics 9: 478 10.1186/1471-2105-9-478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Coleman JJ, Rounsley SD, Rodriguez-Carres M, Kuo A, Wasmann CC, et al. (2009) The genome of Nectria haematococca: contribution of supernumerary chromosomes to gene expansion. PLoS Genetics 5: e1000618 10.1371/journal.pgen.1000618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ma L-J, van der Does HC, Borkovich KA, Coleman JJ, Daboussi M-J, et al. (2010) Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium. Nature 464: 367–373. 10.1038/nature08850 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Croll D, Zala M, McDonald BA (2013) Breakage-fusion-bridge cycles and large insertions contribute to the rapid evolution of accessory chromosomes in a fungal pathogen. PLoS Genetics 9: 20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Goodwin SB, Ben M'Barek S, Dhillon B, Wittenberg AHJ, Crane CF, et al. (2011) Finished genome of the fungal wheat pathogen Mycosphaerella graminicola reveals dispensome structure, chromosome plasticity, and stealth pathogenesis. PLoS Genetics 7: e1002070 10.1371/journal.pgen.1002070 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Balesdent MH, Fudal I, Ollivier B, Bally P, Grandaubert J, et al. (2013) The dispensable chromosome of Leptosphaeria maculans shelters an effector gene conferring avirulence towards Brassica rapa. New Phytol 198: 887–898. 10.1111/nph.12178 [DOI] [PubMed] [Google Scholar]
- 46.Moller M, Stukenbrock EH (2017) Evolution and genome architecture in fungal plant pathogens. Nature Reviews Microbiology 15: 756–771. 10.1038/nrmicro.2017.76 [DOI] [PubMed] [Google Scholar]
- 47.Vanheule A, Audenaert K, Warris S, van de Geest H, Schijlen E, et al. (2016) Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen. BMC Genomics 17: 670 10.1186/s12864-016-2941-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Daverdin G, Rouxel T, Gout L, Aubertot JN, Fudal I, et al. (2012) Genome structure and reproductive behaviour influence the evolutionary potential of a fungal phytopathogen. PLoS Pathogens 8: 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Rouxel T, Grandaubert J, Hane JK, Hoede C, van de Wouw AP, et al. (2011) Effector diversification within compartments of the Leptosphaeria maculans genome affected by Repeat-Induced Point mutations. Nature Communications 2: 202 10.1038/ncomms1189 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Faino L, Seidl MF, Shi-Kunne X, Pauper M, van den Berg GC, et al. (2016) Transposons passively and actively contribute to evolution of the two-speed genome of a fungal pathogen. Genome Research 26: 1091–1100. 10.1101/gr.204974.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Khang CH, Park SY, Lee YH, Valent B, Kang S (2008) Genome organization and evolution of the AVR-Pita avirulence gene family in the Magnaporthe grisea species complex. Molecular Plant Microbe Interactions 21: 658–670. 10.1094/MPMI-21-5-0658 [DOI] [PubMed] [Google Scholar]
- 52.Orbach MJ, Farrall L, Sweigard JA, Chumley FG, Valent B (2000) A telomeric avirulence gene AVR-Pita determines efficacy for the rice blast resistance gene Pi-ta. Plant Cell 12: 2019–2032. 10.1105/tpc.12.11.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Habig M, Kema GH, Holtgrewe Stukenbrock E (2018) Meiotic drive of female-inherited supernumerary chromosomes in a pathogenic fungus. Elife 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Dong Y, Li Y, Zhao M, Jing M, Liu X, et al. (2015) Global genome and transcriptome analyses of Magnaporthe oryzae epidemic isolate 98–06 uncover novel effectors and pathogenicity-related genes, revealing gene gain and lose dynamics in genome evolution. PLoS Pathog 11: e1004801 10.1371/journal.ppat.1004801 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zhang MP, Zhang Y, Scheuring CF, Wu CC, Dong JJ, et al. (2012) Preparation of megabase-sized DNA from a variety of organisms using the nuclei method for advanced genomics research. Nature Protocols 7: 467–478. 10.1038/nprot.2011.455 [DOI] [PubMed] [Google Scholar]
- 56.Clarke JD (2009) Cetyltrimethyl ammonium bromide (CTAB) DNA miniprep for plant DNA isolation. Cold Spring Harbor Protocols 2009: pdb.prot5177. [DOI] [PubMed] [Google Scholar]
- 57.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, et al. (2017) Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research 27: 722–736. 10.1101/gr.215087.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Weisenfeld NI, Yin S, Sharpe T, Lau B, Hegarty R, et al. (2014) Comprehensive variation discovery in single human genomes. Nature Genetics 46: 1350–1355. 10.1038/ng.3121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W (2011) Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27: 578–579. 10.1093/bioinformatics/btq683 [DOI] [PubMed] [Google Scholar]
- 60.Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology 29: 644–U130. 10.1038/nbt.1883 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, et al. (2013) STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29: 15–21. 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15: 550 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate—a Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B-Methodological 57: 289–300. [Google Scholar]
- 64.Cantarel BL, Korf I, Robb SM, Parra G, Ross E, et al. (2008) MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Research 18: 188–196. 10.1101/gr.6743907 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Salamov AA, Solovyev VV (2000) Ab initio gene finding in Drosophila genomic DNA. Genome Research 10: 516–522. 10.1101/gr.10.4.516 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Parra G, Bradnam K, Ning Z, Keane T, Korf I (2009) Assessing the gene space in draft genomes. Nucleic Acids Research 37: 289–297. 10.1093/nar/gkn916 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Nielsen H (2017) Predicting secretory proteins with SignalP. Methods Mol Biology 1611: 59–73. [DOI] [PubMed] [Google Scholar]
- 68.Olshen AB, Venkatraman ES, Lucito R, Wigler M (2004) Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5: 557–572. 10.1093/biostatistics/kxh008 [DOI] [PubMed] [Google Scholar]
- 69.Lee H, Lee M, Mohammed Ismail W, Rho M, Fox GC, et al. (2016) MGEScan: a Galaxy-based system for identifying retrotransposons in genomes. Bioinformatics 32: 2502–2504. 10.1093/bioinformatics/btw157 [DOI] [PubMed] [Google Scholar]
- 70.Xu Z, Wang H (2007) LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Research 35: W265–268. 10.1093/nar/gkm286 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ellinghaus D, Kurtz S, Willhoeft U (2008) LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics 9: 18 10.1186/1471-2105-9-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Gremme G, Steinbiss S, Kurtz S (2013) GenomeTools: a comprehensive software library for efficient processing of structured genome annotations. IEEE/ACM Trans Comput Biol Bioinform 10: 645–656. 10.1109/TCBB.2013.68 [DOI] [PubMed] [Google Scholar]
- 73.Bao W, Kojima KK, Kohany O (2015) Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile DNA 6: 11 10.1186/s13100-015-0041-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing raw data are available from Sequence Read Archive: PRJNA355407. Briefly, PacBio WGS sequencing raw data were deposited at SRR6232283 to SRR6232292. Illumina WGS sequencing raw data were deposited at SRR6232156. LIEP data were deposited at SRR6690042 and SRR6690043. The first cultured RNA-Seq reads were deposited at SRR9126640, and the second RNA-Seq reads were deposited at SRR9127597 to SRR9127602. The Genbank accession number for the B71 genome assembly is MQUD00000000. Custom scripts used in the manuscript are available at Github (https://github.com/liu3zhenlab/manuscripts).