Abstract
(1) Background: Pyricularia oryzae, the causal agent of rice blast disease, is one of the major rice pathogens. The complex population structure of P. oryzae facilitates the rapid virulence variations, which make the blast disease a serious challenge for global food security. There is a large body of existing genomics research on P. oryzae, however the population structure at the pan-genome level is not clear, and the mechanism of genetic divergence and virulence variations of different sub-populations is also unknown. (2) Methods: Based on the genome data published in the NCBI, we constructed a pan-genome database of P. oryzae, which consisted of 156 strains (117 isolated from rice and 39 isolated from other hosts). (3) Results: The pan-genome contained a total of 24,100 genes (12,005 novel genes absent in the reference genome 70-15), including 16,911 (~70%) core genes (population frequency ≥95%) and 1378 (~5%) strain-specific genes (population frequency ≤5%). Gene presence-absence variation (PAV) based clustering analysis of the population structure of P. oryzae revealed four subgroups (three from rice and one from other hosts). Interestingly, the cloned avirulence genes and conventional secreted proteins (SPs, with signal peptides) were enriched in the high-frequency regions and significantly associated with transposable elements (TEs), while the unconventional SPs (without signal peptides) were enriched in the low-frequency regions and not associated significantly with TEs. This pan-genome will expand the breadth and depth of the rice blast fungus reference genome, and also serve as a new blueprint for scientists to further study the pathogenic mechanism and virulence variation of the rice blast fungus.
Keywords: Pyricularia oryzae, pan-genome, rice blast, population structure, secreted proteins
1. Introduction
The rapid host adaptation of fungal pathogens represents a challenge in disease control [1]. The plant fungal pathogens often consist of diverged lineages colonized on different hosts, although some of these lineages are not strictly specialized and have overlaps in host ranges [2,3]. During the adaptation to different plant hosts, the fungal pathogens overcome the host immunity system either through the loss of genes coding for proteins which will activate the effector triggered immunity (ETI) [3,4], or the gain of genes coding for effectors required for the successful invasion of plant cells [5,6]. By alternating sexual and asexual reproduction, many plant fungal pathogens can overcome host resistance and achieve rapid host adaptation. In sexual reproduction, gene flow among different lineages facilitates the gain or loss of pathogenicity associated genes (PAGs) in the populations. In asexual reproduction, an absence of recombination ensures that those lineages with beneficial host adaptation loci are maintained and accumulated [7,8]. Deciphering the population structure of these pathogens will provide enhanced insights into mechanisms that drive rapid host adaptation, facilitate the timely prediction of disease pandemics, and deploy efficient management strategies.
A pan-genome contains all the gene information across all the strains of a species. In contrast to the comparative genomic analysis with the reference genome, it is advanced in obtaining the lineage-specific genes absent in the reference genome [9]. Genes in the pan-genome could be divided into two groups, the “core” genes and the “accessory” or “dispensable” genes, according to the gene frequency in the population [10]. Additionally, the pan-genome provides enhanced insights into the presence/absence variation (PAV)-based genome-wide association study (PAV-GWAS), establishing a possible association between the genotype and phenotype. To date, pan-genomes have been broadly used for population structure assessment, diversity analysis, and identification of important functional genes in humans [11] and plants [12,13,14,15,16,17,18], and also for the identification of novel pathogenic strains, or genes associated with infection in fungi [9,19,20,21] and bacteria [22].
Pyricularia oryzae (syn. Magnaporthe oryzae) is a complex species with different pathotypes infecting diverse grasses. It is most famous as the agent of the pandemic rice blast disease and the emerging wheat blast disease [23]. Lineages from different pathotypes are specialized, albeit not strictly, and are not pathogenic to other hosts. With the exception of the wheat-infecting strains with the Repeat-Induced Point mutation (RIP) mechanism [3,24], evidence supporting a naturally occurring sexual life cycle is still lacking for most of this species, although sexual reproduction could be observed under laboratory condition [2,25]. However, the rapid variations in avirulence genes (AVR) are common in the rice-infecting populations, which remains a considerable challenge in rice blast disease control strategies as most of the rice cultivars carry a single major resistance gene [26,27,28]. Increasing evidence reveals that transposable elements (TEs) are one of the major forces driving the variation of PAG genes in fungal pathogens, consequently promoting rapid pathogen adaptation to the host [28,29,30,31,32]. However, previous reference-genome-based studies mainly focused on the variation of conserved genes and shifted limited insights towards the population structure dynamics of P. oryzae. Here, we constructed a pan-genome consisting of 156 strains from different pathotypes and revealed a new variation pattern of secreted proteins in P. oryzae.
2. Results
2.1. Constructing the Pan-Genome of P. oryzae
Based on the homologous gene clusters (OrthoFinder method), we constructed a rice blast fungus pan-genome that contains genome sequences from 156 strains (117 isolated from rice and 39 isolated from other hosts) from the NCBI repository (Figure 1A). The pan-genome contains 24,100 genes, nearly 1-fold (99.26%, 12,005 novel genes) more than the reference genome 70-15 (12,095 non-redundant genes). According to the frequency of pan-genes in the population, core genes (shared by ≥95% strains) account for 68.4% (half of which are hard-core genes present in all strains), strain-specific genes (only shared by <5% strains) account for 2.8%, and dispensable genes (frequency between 5% to 95%) account for 28.7% (Figure 1B). Among them, secreted proteins occupied a significant proportion (significant enrichment) of dispensable genes (Figure 1B).
As the gene increasing curve showed in Figure 1A, the first 117 rice-infecting strains have no overt difference, and the curve grows gently, while the curve from wheat-infecting strains and the other hosts increases abruptly, indicating that the genes in strains infecting different hosts except for rice are very different from each other. Gene presence (red)-absence (blue) variation (PAV) analysis results showed that ~70% of the pan-genes are core genes which present in almost all the strains (≥95%)(Figure 1C). This result is consistent with population frequency accounting (Figure 1B).
The gene PAV-based clustering analysis showed that all of tested P. oryzae strains consist of four clades (Figure 1D). 117 of rice-infecting strains clustered into three clades, containing 27, 33, and 57 strains, respectively. A total of 39 of the wheat or other host-infecting strains were clustered as Clade 4 (Figure 1D). This result is consistent with our previous results based on SNPs analysis of the P. oryzae population [33].
In order to facilitate the usage of these huge data generated in this study, we have built a website of the pan-genome Database of Rice Blast Fungus (http://47.107.41.214/, accessed on 22 November 2022), which is an open access resource. The website provides functions including ID or sequence similarity search, data browser such as homologous gene clusters, gene presence and absence polymorphisms, information on the strains used in this study, and download of sequences of the pan-genome.
2.2. Functional Enrichment Analysis of Pan-Genes
Functional enrichment analysis found that different types of pan-genes in P. oryzae performed different function preferences. The strain-specific genes are mainly involved in the individual adaptation and response to the changing environment, like Photosynthesis, Cytochrome P450, and Cellulose synthase (Figure 2A,B), while the core genes preferably regulate basic internal cellular processes, such as protein/ATP/FAD binding and sugar (and other) transporter (Figure 2C,D).
2.3. Functional Enrichment Analysis of Newly Identified Genes in Pan-Genome
Compared with the reference genome 70-15 (12,095 unique genes), the pan-genome of 117 strains from the rice host contains a total of 22,882 genes and revealed 10,787 new genes (~90%) absent in the 70-15 genome (Figure 3A). Further analyses showed a significant enrichment of genes coding secreted proteins in these novel genes which were significantly associated with transposable elements (TE) (Figure 3B). These novel genes from rice-infecting strains are mainly involved in the regulation of alcoholism, Glycolysis/Gluconeogenesis and IL-17 signaling pathway in KEGG annotation, and genes encoding tropomyosin, such as RNase H and Photosynthetic reaction center proteins in Pfam annotation (Figure 3C).
Compared with the pan-genes in rice-infecting strains, strains from other hosts had a total of 23,139 genes, including 1218 new genes (Figure 3A). Likewise, these novel genes from non-rice host strains were also enriched for secreted proteins and significantly associated with TE (Figure 3B). Functionally, these novel genes are enriched in KEGG pathways mainly involved the regulation of oxidation-reduction processes, enoyl-[acyl-carrier-protein] reductase (NADH) activity, and ADP binding, as well as being enriched in Pfam annotations, such as the cytochrome P450 family, Carboxylesterase family, and some conserved hypothetical proteins (Figure 3D).
2.4. Functional Analysis of Population Differentiation Related Genes in the Pan-Genome
Through the analysis of the subgroup-specific genes (SSGs) in the rice host strains, we found that the subgroups Clade 1, Clade 2, and Clade 3 have 195, 165, and 296 SSGs, respectively, which were present only in selected subgroups, but absent in other subgroups (Figure 4A). Gene functional association analysis showed that there is not a significantly association between SSGs and pathogen–host interaction-related genes (PHI), TE and secreted proteins, except that SSGs in Clade 1 have a significant association with TE (6.03 × 10−8) (Figure 4B). The KEGG pathway enrichment analysis showed that these genes were mainly enriched in pathways like lysine biosynthesis, starch and sucrose metabolism, and phenylpropane biosynthesis (Table 1).
Table 1.
Type | Enriched KEGG Pathway | p-Value |
---|---|---|
Clade 1 | Lysine biosynthesis | 1.07 × 10−2 |
Clade 1 | Drug metabolism—other enzymes | 1.07 × 10−2 |
Clade 1 | Sulfur metabolism | 2.12 × 10−2 |
Clade 2 | Starch and sucrose metabolism | 7.33 × 10−3 |
Clade 2 | Glyoxylate and dicarboxylate metabolism | 7.33 × 10−3 |
Clade 2 | Nitrogen metabolism | 7.33 × 10−3 |
Clade 3 | Carbon fixation in photosynthetic organisms | 1.94 × 10−3 |
Clade 3 | Hepatocellular carcinoma | 6.72 × 10−3 |
Clade 3 | Phenylpropanoid biosynthesis | 1.42 × 10−2 |
Meanwhile, for the core genes under selection (CGUS) 117 rice-infecting strains were also selected as the population differentiation candidate genes. According to the values of Fst and Tajima’s D (Fst > 0.25, Tajima’s D < 0), 576 (Clade 1 vs. Clade 2), 620 (Clade 1 vs. Clade 3), and 697 (Clade 2 vs. Clade 3) CGUS were obtained (Figure 4B). All of them were significantly associated with the PHI, but not significantly associated with TE and secreted proteins (Figure 4B). Pfam enrichment analysis showed that these genes mainly enriched in the phospholipase D active site motif, E1-E2 ATPase, and ubiquitin carboxyl-terminal hydrolase (Table 2).
Table 2.
Type | Pfam ID | Pfam Description | p-Value |
---|---|---|---|
clade2 vs. clade3 | PF00614 | Phospholipase D Active site motif | 3.68 × 10−4 |
clade2 vs. clade3 | PF00122 | E1-E2 ATPase | 4.53 × 10−4 |
clade1 vs. clade2 | PF00443 | Ubiquitin carboxyl-terminal hydrolase | 5.54 × 10−4 |
clade1 vs. clade2 | PF00082 | Subtilase family | 6.69 × 10−4 |
clade1 vs. clade3 | PF04082 | Fungal specific transcription factor domain | 1.09 × 10−3 |
clade1 vs. clade3 | PF00082 | Subtilase family | 1.18 × 10−3 |
clade2 vs. clade3 | PF00454 | Phosphatidylinositol 3- and 4-kinase | 1.46 × 10−3 |
clade2 vs. clade3 | PF00702 | haloacid dehalogenase-like hydrolase | 2.16 × 10−3 |
clade1 vs. clade3 | PF06422 | CDR ABC transporter | 2.33 × 10−3 |
clade1 vs. clade3 | PF14510 | ABC-transporter N-terminal | 2.33 × 10−3 |
2.5. Pan-Genomics Reveals a New Variation Pattern of Secreted Proteins in P. oryzae
We calculated the population frequencies of the nine cloned avirulence (AVR) genes in P. oryzae and found that most of the AVR genes belong to the core genes (Figure 5A), instead of what we usually consider variable genes. AVR1-CO39, AVR-Pi54, AvrPi9 and AvrPiz-t are 100% present in all of the collected strains, except for AvrPii and AvrPia, which are present in only a few strains. In order to verify the distribution of effector proteins, we counted the gene frequencies of all predicted secreted proteins in the rice-infecting strains and found a similar trend in the secreted proteins enriched in high-frequency core genes or low-frequency strain-specific genes (Figure 5B). Further analysis found that conventional secreted proteins (with signal peptides) were enriched in high-frequency near-core genes (p = 4.05 × 10−37) and were significantly associated with TE (p = 4.90 × 10−18). Meanwhile, unconventional secreted proteins (without signal peptides) were enriched in the low-frequency strain-specific genes (p = 4.34 × 10−44) but were not significantly associated with TE (Figure 5C).
3. Discussion
In this study, we constructed a pan-genome of P. oryzae using 156 genome assemblies, including 117 rice-infecting strains and 39 strains from other hosts. The pan-genome consists of 24,100 genes, which is about two times of that of the reference genome 70-15. Consistent with previous studies in fungi [9,19,20,34], core genes constitute the major component of the P. oryzae pan-genome (~70%). The newly identified genes enriched in secreted proteins as well as proteins involved in diverse cellular processes. In contrast to the 70-15 reference genome, some of the field isolates contain either supernumerary chromosomes (accessory chromosome or lineage-specific chromosomes), including the wheat blast isolate B71, rice blast isolates Y34 and FR13, and the grass isolate TF05-1 [35,36,37], or lineage-specific regions [3,38,39]. These isolate-specific genetic components often encode unique genes involved in the interaction with their respective host plants, and therefore may largely contribute to the dynamic pan-genome, as well as the host adaption of P. oryzae.
In addition to the other environmental factors [2], host selective stress is the major force driving the specialization of pathogens [25,40]. In this study, the PAV of genes subdivided the isolates into four Clades, which is consistent with the previous result from our lab using whole genome SNP [33]. The gain and loss of effector gene function are essential for the host adaptation of different pathotypes within a species [4,41,42]. For example, the P. oryzae isolate that caused the outbreak of wheat blast disease in Brazil was supposed to be derived from the host jump of a Lolium isolate through the loss of function of the PWT3 gene [3]. We have previously found that all the cloned AVR genes in P. oryzae were absent in the P. penniseti isolate P1609 [43]. However, our results in current study suggested a ubiquitous distribution of the cloned AVR genes in all the isolates investigated, except for AvrPii and AvrPia which are exclusively present in only a few isolates. The discrepancy could be caused by the isolates used in this study.
TEs are often enriched in the fast-evolving compartments of fungal pathogen genomes and are associated with variations in chromosome structure and gene function, such as deletion, insertion, or expression [31,44,45]. Therefore, the localization of PAGs in proximity to TEs therefore constitutes a rapid host adaptive evolution mechanism for the fungal pathogens [30,46,47]. Here, we showed that both the core and accessory secreted proteins in the pan-genome are associated with TE, which is in agreement with our previous study [29]. Importantly, we further compared the distribution of secreted proteins with and without N-terminal signaling peptides (conventional and unconventional SPs). We found that the former was enriched in the high-frequency region and localized proximately to TEs, and the latter was enriched in the low-frequency region and not associated with TEs. We inferred from these results that unconventional SPs without N-terminal signaling peptide might employ distinct variation mechanism compared with conventional SPs with a signal peptide.
Although only those genomes with 95% completeness were used in this study, the quality of this pan-genome was insufficient for scrutinizing additional features, such as the TE insertion around effector genes. Furthermore, more complete genomes derived from long reads sequencing data are necessary to produce deeper insight into the dynamic pan-genome structure and the rapid adaptive evolution of this notorious fungal pathogen.
4. Materials and Methods
4.1. Genomics Data Collection
A total of 199 genome assembies of P. oryzae were collected (Table S1) from the NCBI assembly database (https://www.ncbi.nlm.nih.gov/genome/browse/#!/eukaryotes/62/, accessed on 30 December 2019). Of those, over 50% were submitted by our group (Prof. Zonghua Wang’s lab in Fujian Agriculture and Forestry University). The quality of these genome assemblies was assessed by BUSCO v4.1.2 [48] with core orthologs at sordariomycetes level (n = 3817), and a total of 156 genome assemblies from different strains with BUSCO completeness greater than 95% were selected for pan-genome analysis.
4.2. Gene Prediction
The 156 genome assemblies were repeat masked by RepeatMasker v4.1.0 (http://www.repeatmasker.org/, accessed on 1 June 2021), and then used for genes prediction by the software fgenesh (species = Magnaporthe oryzae) in MolQuest v2.4.5 (http://molquest.com, accessed on 10 January 2021).
4.3. Pan-Genome Construction
All protein sequences of the 156 strains were clustered together using OrthoFinder v2.5.1 [49] to obtain the homologous gene clusters. Those strain-specific unclustered genes and representative genes in each gene clusters were selected to construct a pan-genome.
4.4. Presence/Absence Variations (PAVs) of Pan-Genes
PAVs of pan-genes in the population were determined by sequence similarity search conducted by NCBI-BLAST tools against genome sequence of each strain. According to the gene frequency in the population, genes were divided into core genes (≥95%), strain-specific genes (≤5%) and dispensable genes (5%–95%).
4.5. Clustering Analysis of Population Structure
The R package clusterProfiler [50] was used for population structure analysis, the strains of rice blast fungus were divided into different subgroups based on PAVs.
4.6. Gene Functional Annotation and Enrichment Analysis
KEGG (Kyoto Encyclopedia of Genes and Genomes) was annotated by with KofamKOALA (https://www.genome.jp/tools/kofamkoala/, accessed on 30 June 2021), KOG was annotated by eggNOGMapperV2 (http://eggnog-mapper.embl.de/, accessed on 30 June 2021), Pfam and GO were annotated by InterProScan v5.55-88.0 (https://www.ebi.ac.uk/interpro/, accessed on 30 June 2021), pathogenicity-related genes were annotated by PHI-base (http://www.phi-base.org/, accessed on 30 June 2021). ClusterProfiler [50] was used for gene functional enrichment analysis.
4.7. Population Differentiation Related Genes Analysis
For the core genes present in relevant subgroup, we used MAFFT (https://mafft.cbrc.jp, accessed on 1 August 2021) to perform multiple sequence alignment of the genome sequences corresponding to the homologous gene clusters, and then used the R package PopGenome [51] to calculate Tajima’s D and Fst. Genes with Fst > 0.25 and Tajima’s D < 0 were selected as candidate population differentiation related genes.
For the unique genes of each subgroup, we selected the genes conserved in a certain subgroup but absent in other subgroups as candidates of population differentiation related genes.
Acknowledgments
We are grateful to Norvienyeku Justice at the school of Plant Protection, Hainan University for helping to revise and improve our manuscript’s language and scientific quality.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jof8121238/s1, Table S1: The genome assemblies used in this study.
Author Contributions
Conceptualization, J.B. and Z.W. (Zhe Wang); methodology, J.B. and Z.W. (Zhe Wang); software, Z.W. (Zhe Wang) and J.B.; validation, X.C., J.X. and W.T.; formal analysis, J.B. and Z.W. (Zhe Wang); investigation, J.B. and Z.W. (Zhe Wang); resources, M.C., S.C. and W.T; data curation, J.B., Z.W (Zhe Wang). and S.C.; writing—original draft preparation, J.B., Z.W. (Zhe Wang), H.Z. and M.C.; writing—review and editing, J.B., H.Z. and Z.W. (Zonghua Wang); visualization, Z.W. (Zhe Wang) and S.C.; supervision, M.C., H.Z. and Z.W. (Zonghua Wang); project administration, J.B. and Z.W. (Zonghua Wang); funding acquisition, Z.W. (Zonghua Wang), J.B., H.Z. and M.C. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
All the data reported in this study could be found at the website of the Pan-genome Database of Rice Blast Fungus (http://47.107.41.214/).
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This work was supported by grants from the National Natural Science Foundation of China (NSFC) to ZHW (U1805232), JDB (32270078), HKZ (32172365) and MLC (32001976).
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Heath M.C. A generalized concept of host-parasite specificity. Phytopathology. 1981;71:1121–1123. doi: 10.1094/Phyto-71-1121. [DOI] [Google Scholar]
- 2.Thierry M., Charriat F., Milazzo J., Adreit H., Ravel S., Cros-Arteil S., Borron S., Sella V., Kroj T., Ioos R., et al. Maintenance of divergent lineages of the Rice Blast Fungus Pyricularia oryzae through niche separation, loss of sex and post-mating genetic incompatibilities. PLoS Pathog. 2022;18:e1010687. doi: 10.1371/journal.ppat.1010687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Inoue Y., Vy T.T.P., Yoshida K., Asano H., Mitsuoka C., Asuke S., Anh V.L., Cumagun C.J.R., Chuma I., Terauchi R., et al. Evolution of the wheat blast fungus through functional losses in a host specificity determinant. Science. 2017;357:80–83. doi: 10.1126/science.aam9654. [DOI] [PubMed] [Google Scholar]
- 4.Sharma R., Mishra B., Runge F., Thines M. Gene loss rather than gene gain is associated with a host jump from monocots to dicots in the Smut Fungus Melanopsichium pennsylvanicum. Genome Biol. Evol. 2014;6:2034–2049. doi: 10.1093/gbe/evu148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McDonald B.A., Linde C. Pathogen population genetics, evolutionary potential, and durable resistance. Annu. Rev. Phytopathol. 2002;40:349–379. doi: 10.1146/annurev.phyto.40.120501.101443. [DOI] [PubMed] [Google Scholar]
- 6.Ma L.J., van der Does H.C., Borkovich K.A., Coleman J.J., Daboussi M.J., Di Pietro A., Dufresne M., Freitag M., Grabherr M., Henrissat B., et al. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium. Nature. 2010;464:367–373. doi: 10.1038/nature08850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gladieux P., Feurtey A., Hood M.E., Snirc A., Clavel J., Dutech C., Roy M., Giraud T. The population biology of fungal invasions. Mol. Ecol. 2015;24:1969–1986. doi: 10.1111/mec.13028. [DOI] [PubMed] [Google Scholar]
- 8.de Vienne D.M., Giraud T., Gouyon P.H. Lineage Selection and the Maintenance of Sex. PLoS ONE. 2013;8:e66906. doi: 10.1371/journal.pone.0066906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.McCarthy C.G.P., Fitzpatrick D.A. Pan-genome analyses of model fungal species. Microb. Genom. 2019;5:e000243. doi: 10.1099/mgen.0.000243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tettelin H., Masignani V., Cieslewicz M.J., Donati C., Medini D., Ward N.L., Angiuoli S.V., Crabtree J., Jones A.L., Durkin A.S., et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”. Proc. Natl. Acad. Sci. USA. 2005;102:13950–13955. doi: 10.1073/pnas.0506758102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sherman R.M., Salzberg S.L. Pan-genomics in the human genome era. Nat. Rev. Genet. 2020;21:243–254. doi: 10.1038/s41576-020-0210-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Laflamme B., Dillon M.M., Martel A., Almeida R.N.D., Desveaux D., Guttman D.S. The pan-genome effector-triggered immunity landscape of a host-pathogen interaction. Science. 2020;367:763–768. doi: 10.1126/science.aax4079. [DOI] [PubMed] [Google Scholar]
- 13.Qin P., Lu H., Du H., Wang H., Chen W., Chen Z., He Q., Ou S., Zhang H., Li X., et al. Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations. Cell. 2021;184:3542–3558.e3516. doi: 10.1016/j.cell.2021.04.046. [DOI] [PubMed] [Google Scholar]
- 14.Wang W., Mauleon R., Hu Z., Chebotarov D., Tai S., Wu Z., Li M., Zheng T., Fuentes R.R., Zhang F., et al. Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature. 2018;557:43–49. doi: 10.1038/s41586-018-0063-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Shang L., Li X., He H., Yuan Q., Song Y., Wei Z., Lin H., Hu M., Zhao F., Zhang C., et al. A super pan-genomic landscape of rice. Cell Res. 2022;32:878–896. doi: 10.1038/s41422-022-00685-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Liu Y., Tian Z. Super graph-based pan-genome: Bringing rice functional genomic study into a new dawn. Mol. Plant. 2022;15:1409–1411. doi: 10.1016/j.molp.2022.07.005. [DOI] [PubMed] [Google Scholar]
- 17.Zhao Q., Feng Q., Lu H., Li Y., Wang A., Tian Q., Zhan Q., Lu Y., Zhang L., Huang T., et al. Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice. Nat. Genet. 2018;50:278–284. doi: 10.1038/s41588-018-0041-z. [DOI] [PubMed] [Google Scholar]
- 18.Gao L., Gonda I., Sun H., Ma Q., Bao K., Tieman D.M., Burzynski-Chang E.A., Fish T.L., Stromberg K.A., Sacks G.L., et al. The tomato pan-genome uncovers new genes and a rare allele regulating fruit flavor. Nat. Genet. 2019;51:1044–1051. doi: 10.1038/s41588-019-0410-2. [DOI] [PubMed] [Google Scholar]
- 19.Barber A.E., Sae-Ong T., Kang K., Seelbinder B., Li J., Walther G., Panagiotou G., Kurzai O. Aspergillus fumigatus pan-genome analysis identifies genetic variants associated with human infection. Nat. Microbiol. 2021;6:1526–1536. doi: 10.1038/s41564-021-00993-x. [DOI] [PubMed] [Google Scholar]
- 20.Horta M.A.C., Steenwyk J.L., Mead M.E., Dos Santos L.H.B., Zhao S., Gibbons J.G., Marcet-Houben M., Gabaldon T., Rokas A., Goldman G.H. Examination of Genome-Wide Ortholog Variation in Clinical and Environmental Isolates of the Fungal Pathogen Aspergillus fumigatus. mBio. 2022;13:e0151922. doi: 10.1128/mbio.01519-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kato R.B., Jaiswal A.K., Tiwari S., Barh D., Azevedo V., Góes-Neto A. Chapter 12—Pan-genomics of fungi and its applications. In: Barh D., Soares S., Tiwari S., Azevedo V., editors. Pan-genomics: Applications, Challenges, and Future Prospects. Academic Press; Cambridge, MA, USA: 2020. pp. 251–260. [Google Scholar]
- 22.de Jesus Sousa T., Jaiswal A.K., Hurtado R.E., de Oliveira Tosta S.F., de Castro Soares S., Gomide A.C.P., Alcantara L.C.J., Barh D., Azevedo V., Tiwari S. Chapter 5—Pan-genomics of veterinary pathogens and its applications. In: Barh D., Soares S., Tiwari S., Azevedo V., editors. Pan-Genomics: Applications, Challenges, and Future Prospects. Academic Press; Cambridge, MA, USA: 2020. pp. 101–119. [Google Scholar]
- 23.Valent B. The Impact of Blast Disease: Past, Present, and Future. Methods Mol. Biol. 2021;2356:1–18. doi: 10.1007/978-1-0716-1613-0_1. [DOI] [PubMed] [Google Scholar]
- 24.Ikeda K., Nakayashiki H., Kataoka T., Tamba H., Hashimoto Y., Tosa Y., Mayama S. Repeat-induced point mutation (RIP) in Magnaporthe grisea: Implications for its sexual cycle in the natural field context. Mol. Microbiol. 2002;45:1355–1364. doi: 10.1046/j.1365-2958.2002.03101.x. [DOI] [PubMed] [Google Scholar]
- 25.Zhong Z., Norvienyeku J., Chen M., Bao J., Lin L., Chen L., Lin Y., Wu X., Cai Z., Zhang Q., et al. Directional Selection from Host Plants Is a Major Force Driving Host Specificity in Magnaporthe Species. Sci. Rep. 2016;6:25591. doi: 10.1038/srep25591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Huang J., Si W., Deng Q., Li P., Yang S. Rapid evolution of avirulence genes in rice blast fungus Magnaporthe oryzae. BMC Genet. 2014;15:45. doi: 10.1186/1471-2156-15-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Chuma I., Isobe C., Hotta Y., Ibaragi K., Futamata N., Kusaba M., Yoshida K., Terauchi R., Fujita Y., Nakayashiki H., et al. Multiple translocation of the AVR-Pita effector gene among chromosomes of the rice blast fungus Magnaporthe oryzae and related species. PLoS Pathog. 2011;7:e1002147. doi: 10.1371/journal.ppat.1002147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhang S., Wang L., Wu W., He L., Yang X., Pan Q. Function and evolution of Magnaporthe oryzae avirulence gene AvrPib responding to the rice blast resistance gene Pib. Sci. Rep. 2015;5:11642. doi: 10.1038/srep11642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bao J., Chen M., Zhong Z., Tang W., Lin L., Zhang X., Jiang H., Zhang D., Miao C., Tang H., et al. PacBio Sequencing Reveals Transposable Elements as a Key Contributor to Genomic Plasticity and Virulence Variation in Magnaporthe oryzae. Mol. Plant. 2017;10:1465–1468. doi: 10.1016/j.molp.2017.08.008. [DOI] [PubMed] [Google Scholar]
- 30.Dong S., Raffaele S., Kamoun S. The two-speed genomes of filamentous pathogens: Waltz with plants. Curr. Opin. Genet. Dev. 2015;35:57–65. doi: 10.1016/j.gde.2015.09.001. [DOI] [PubMed] [Google Scholar]
- 31.Faino L., Seidl M.F., Shi-Kunne X., Pauper M., van den Berg G.C., Wittenberg A.H., Thomma B.P. Transposons passively and actively contribute to evolution of the two-speed genome of a fungal pathogen. Genome Res. 2016;26:1091–1100. doi: 10.1101/gr.204974.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Rouxel T., Grandaubert J., Hane J.K., Hoede C., van de Wouw A.P., Couloux A., Dominguez V., Anthouard V., Bally P., Bourras S., et al. Effector diversification within compartments of the Leptosphaeria maculans genome affected by Repeat-Induced Point mutations. Nat. Commun. 2011;2:202. doi: 10.1038/ncomms1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhong Z., Chen M., Lin L., Han Y., Bao J., Tang W., Lin L., Lin Y., Somai R., Lu L., et al. Population genomic analysis of the rice blast fungus reveals specific events associated with expansion of three main clades. ISME J. 2018;12:1867–1878. doi: 10.1038/s41396-018-0100-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Plissonneau C., Hartmann F.E., Croll D. Pangenome analyses of the wheat pathogen Zymoseptoria tritici reveal the structural basis of a highly plastic eukaryotic genome. BMC Biol. 2018;16:5. doi: 10.1186/s12915-017-0457-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Langner T., Harant A., Gomez-Luciano L.B., Shrestha R.K., Malmgren A., Latorre S.M., Burbano H.A., Win J., Kamoun S. Genomic rearrangements generate hypervariable mini-chromosomes in host-specific isolates of the blast fungus. PLoS Genet. 2021;17:e1009386. doi: 10.1371/journal.pgen.1009386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Peng Z., Oliveira-Garcia E., Lin G., Hu Y., Dalby M., Migeon P., Tang H., Farman M., Cook D., White F.F., et al. Effector gene reshuffling involves dispensable mini-chromosomes in the wheat blast fungus. PLoS Genet. 2019;15:e1008272. doi: 10.1371/journal.pgen.1008272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Xue M., Yang J., Li Z., Hu S., Yao N., Dean R.A., Zhao W., Shen M., Zhang H., Li C., et al. Comparative analysis of the genomes of two field isolates of the rice blast fungus Magnaporthe oryzae. PLoS Genet. 2012;8:e1002869. doi: 10.1371/journal.pgen.1002869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yoshida K., Saitoh H., Fujisawa S., Kanzaki H., Matsumura H., Yoshida K., Tosa Y., Chuma I., Takano Y., Win J., et al. Association genetics reveals three novel avirulence genes from the rice blast fungal pathogen Magnaporthe oryzae. Plant Cell. 2009;21:1573–1591. doi: 10.1105/tpc.109.066324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dong Y., Li Y., Zhao M., Jing M., Liu X., Liu M., Guo X., Zhang X., Chen Y., Liu Y., et al. Global genome and transcriptome analyses of Magnaporthe oryzae epidemic isolate 98-06 uncover novel effectors and pathogenicity-related genes, revealing gene gain and lose dynamics in genome evolution. PLoS Pathog. 2015;11:e1004801. doi: 10.1371/journal.ppat.1004801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Liao J., Huang H., Meusnier I., Adreit H., Ducasse A., Bonnot F., Pan L., He X., Kroj T., Fournier E., et al. Pathogen effectors and plant immunity determine specialization of the blast fungus to rice subspecies. Elife. 2016;5:e19377. doi: 10.7554/eLife.19377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hu Z.J., Huang Y.Y., Lin X.Y., Feng H., Zhou S.X., Xie Y., Liu X.X., Liu C., Zhao R.M., Zhao W.S., et al. Loss and natural variations of blast fungal avirulence genes breakdown rice resistance genes in the Sichuan Basin of China. Front. Plant Sci. 2022;13:788876. doi: 10.3389/fpls.2022.788876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Fouche S., Plissonneau C., Croll D. The birth and death of effectors in rapidly evolving filamentous pathogen genomes. Curr. Opin. Microbiol. 2018;46:34–42. doi: 10.1016/j.mib.2018.01.020. [DOI] [PubMed] [Google Scholar]
- 43.Zheng H., Zhong Z., Shi M., Zhang L., Lin L., Hong Y., Fang T., Zhu Y., Guo J., Zhang L., et al. Comparative genomic analysis revealed rapid differentiation in the pathogenicity-related gene repertoires between Pyricularia oryzae and Pyricularia penniseti isolated from a Pennisetum grass. BMC Genom. 2018;19:927. doi: 10.1186/s12864-018-5222-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chan J.E., Kolodner R.D. A genetic and structural study of genome rearrangements mediated by high copy repeat Ty1 elements. PLoS Genet. 2011;7:e1002089. doi: 10.1371/journal.pgen.1002089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bourque G., Burns K.H., Gehring M., Gorbunova V., Seluanov A., Hammell M., Imbeault M., Izsvak Z., Levin H.L., Macfarlan T.S., et al. Ten things you should know about transposable elements. Genome Biol. 2018;19:199. doi: 10.1186/s13059-018-1577-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Torres D.E., Thomma B., Seidl M.F. Transposable Elements Contribute to Genome Dynamics and Gene Expression Variation in the Fungal Plant Pathogen Verticillium dahliae. Genome Biol. Evol. 2021;13:evab135. doi: 10.1093/gbe/evab135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yoshida K., Saunders D.G., Mitsuoka C., Natsume S., Kosugi S., Saitoh H., Inoue Y., Chuma I., Tosa Y., Cano L.M., et al. Host specialization of the blast fungus Magnaporthe oryzae is associated with dynamic gain and loss of genes linked to transposable elements. BMC Genom. 2016;17:370. doi: 10.1186/s12864-016-2690-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Manni M., Berkeley M.R., Seppey M., Simao F.A., Zdobnov E.M. BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Mol. Biol. Evol. 2021;38:4647–4654. doi: 10.1093/molbev/msab199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Emms D.M., Kelly S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20:238. doi: 10.1186/s13059-019-1832-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Wu T., Hu E., Xu S., Chen M., Guo P., Dai Z., Feng T., Zhou L., Tang W., Zhan L., et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb) 2021;2:100141. doi: 10.1016/j.xinn.2021.100141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Pfeifer B., Wittelsburger U., Ramos-Onsins S.E., Lercher M.J. PopGenome: An efficient Swiss army knife for population genomic analyses in R. Mol. Biol. Evol. 2014;31:1929–1936. doi: 10.1093/molbev/msu136. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the data reported in this study could be found at the website of the Pan-genome Database of Rice Blast Fungus (http://47.107.41.214/).