Skip to main content
Genomics Data logoLink to Genomics Data
. 2017 Oct 25;14:106–113. doi: 10.1016/j.gdata.2017.10.003

Comparative analysis of genome sequences of the conifer tree pathogen, Heterobasidion annosum s.s.

Jaeyoung Choi a,1, Gir-Won Lee b,2, Ki-Tae Kim c, Jongbum Jeon c, Nicolas Détry a, Hsiao-Che Kuo a, Hui Sun a, Fred O Asiegbu a, Yong-Hwan Lee a,c,d,
PMCID: PMC5654758  PMID: 29085779

Abstract

The causal agent of root and butt rot of conifer trees, Heterobasidion annosum, is widespread in boreal forests and economically responsible for annual loss of approximately 50 million euros to forest industries in Finland alone and much more at European level. In order to further understand the pathobiology of this fungus at the genome level, a Finnish isolate of H. annosum sensu stricto (isolate 03012) was sequenced and analyzed with the genome sequences of 23 white-rot and 13 brown-rot fungi. The draft genome assembly of H. annosum has a size of 31.01 Mb, containing 11,453 predicted genes. Whole genome alignment showed that 84.38% of H. annosum genome sequences were aligned with those of previously sequenced H. irregulare TC 32-1 counterparts. The result is further supported by the protein sequence clustering analysis which revealed that the two genomes share 6719 out of 8647 clusters. When sequencing reads of H. annosum were aligned against the genome sequences of H. irregulare, six single nucleotide polymorphisms were found in every 1 kb, on average. In addition, 98.68% of SNPs were found to be homo-variants, suggesting that the two species have long evolved from different niches. Gene family analysis revealed that most of the white-rot fungi investigated had more gene families involved in lignin degradation or modification, including laccases and peroxidase. Comparative analysis of the two Heterobasidion spp. as well as white-/brown-rot fungi would provide new insights for understanding the pathobiology of the conifer tree pathogen.

Keywords: Heterobasidion annosum, Annosum root rot, Conifer tree pathogen, Whole genome sequencing, Comparative genomics

1. Introduction

Heterobasidion annosum is the causal agent of root and butt rot of conifer trees in the forests of northern Europe and North America, causing annual economic loss of 800 million euros in Europe alone [1], [2]. Heterobasidion species complex comprises the European H. annosum, H. abietinum, H. parviporum, and the North American H. irregulare and H. occidentale. They exhibit differences in geographic distribution, lifestyles, interfertility, and host ranges [3]. Heterobasidion spp. have been studied with regard to ecology, control methods, physiology, and evolution [4], [5], [6]. Due to their socioeconomic and biological importance, Heterobasidion species complex has been one of the most studied forest fungi [3].

Recently H. irregulare TC 32-1 was fully sequenced, which was the first sequenced basidiomycete forest phytopathogen [7]. With the aid of this, genome-wide association study was able to identity single nucleotide polymorphisms (SNPs) associated with fungal virulence [8]. Transcriptomic profiling became more informative by using the gene models from the taxonomically close species, vitalizing gene expression-associated studies [9], [10], [11]. However, the sequenced North American isolate H. irregulare is genetically and physiologically different from the European H. annosum [12], [13]. The more genome sequences from various geographic locations would enable more accurate analysis and capture species-specific characteristics as well.

Here, we report the genome sequences of a Finnish isolate of H. annosum sensu stricto (isolate 03012) to further understand the pathobiology of this fungus at genome level. In order to assess the macroscopic similarity between H. annosum and H. irregulare, genome-wide alignment and proteins clustering were conducted. Gene synteny analysis showed species-specific genes, implying genomic rearrangements after speciation. In addition, SNPs and short insertions/deletions (InDels) were identified to elucidate microscopic differences between the two species. With the genome sequences of 35 wood-rotting fungi obtained from JGI MycoCosm [14], comparative gene family analysis was conducted to review the white- and brown-rot classification, focusing on their capability to degrade plant materials. As a result, white-rot fungi had more gene families involved in lignin degradation or modification, including laccases and peroxidase, than brown-rot fungi. Collectively, the results presented in this study may provide new insights for understanding the pathobiology of this conifer tree pathogen.

2. Results and discussion

2.1. Comparisons between the genome and proteome sequences of H. annosum and H. irregulare

The genome of H. annosum sensu stricto (isolate 03012) was assembled into 31,007,627 bp with GC content of 51.55%, representing 66.5-fold coverage. Subsequent gene prediction showed that H. annosum was predicted to have 11,453 protein-coding genes. A series of genomic analyses were carried out so as to identify similarities and differences between the genome sequences of H. annosum and H. irregulare (Fig. 1). As a result, the genome sequences of H. annosum showed similar genomic characteristics with those of H. irregulare (Table 1). Protein clustering results showed that 6719 out of 8647 clusters were shared between H. annosum and H. irregulare. The 6719 clusters encompassed 10,701 and 12,193 proteins from H. annosum and H. irregulare, respectively, which account for 93.43% and 90.96% of the whole protein sequences. Among the shared clusters, 4804 clusters contained a single protein member from each species. A large number of shared clusters clearly showed their close relatedness with each other. Their close connection was also shown in the whole genome alignment. It showed that 84.38% of H. annosum genome sequences were aligned with H. irregulare counterparts. In addition, the aligned regions showed 96.06% of sequence identity across the whole genome, on average (Fig. S1). Meanwhile, structural variations, including SNPs and short InDels, further confirmed the taxonomic differences between the two species. A total of 188,190 SNPs were found when the sequencing reads of H. annosum were mapped against the genome sequences of H. irregulare. The majority of SNPs were found in genic regions (145,640), including exons and introns and untranscribed regions. In spite of macro level similarity between the two species, approximately six SNPs were found in every 1 kb, implying that there were great extent of genome modifications in micro level. In addition, 98.68% of the whole variations were found to be homo-variant, suggesting that there was long enough time for those variations to be fixed. Effects of SNPs and InDels were also pronounced in genic regions, followed by upstream, downstream and intergenic regions (Fig. 2a). Among the SNPs, transitions were far more frequently found than transversions, showing the ratio of transitions to transversions to be 3.57. When investigated the dinucleotide changes, the ratio of two transitions, AT to GC and GC to AT, accounted for 73.90%, on average. Each of the four transversions only accounted for 5.00 to 10.10% of the corresponding types (Fig. 2b). Though 92.11% of InDels showed their length within 10 bp, longer ones, up to 42 bp, were also detected (Fig. 2c). It might also suggest that the two species have gone through long-standing active mutations. This is in the line with the earlier report that the divergence of H. annosum and H. irregulare occurred around 30 mya [6]. In addition, a genome-wide association study by sequencing 23 haploid H. annosum s.s. in low-depth sequencing showed that there were approximately 33,000 SNPs for all individuals [8]. Considering that it was in low coverage, it is in agreement with our results in a way that there are many SNPs to convince they have diverged a long time ago.

Fig. 1.

Fig. 1

Genome comparisons of the two Heterobasidion spp. (a) Genomic characteristics of H. annosum are displayed, including density of repetitive elements, predicted genes, orthologous genes found in H. irregulare, scaffold representation, and genes involved in degradation of plant materials (aryl-alcohol dehydrogenase in gray, aryl-alcohol oxidase in black, CAZyme predicted by dbCAN in red, laccase in green, peroxidase in blue and quinone reductase in purple). (b) Genomic characteristics of H. irregulare genome and comparisons to H. annosum genome are presented, including SNP density (purple as the lowest and red as the highest), frequency of insertions/deletions (shown in green and red, respectively), density of repetitive elements, predicted genes, orthologous genes found in H. annosum, scaffold representation, and genes involved in degradation of plant materials with the same color codes used in Fig. 1a. The tracks were described from the innermost circle.

Table 1.

Summary of H. annosum 03012 genome in comparison with that of H. irregulare TC 32-1.

H. annosum 03012 H. irregulare TC 32-1
Assembled genome size 31.01 Mb 33.65 Mb
Number of contigs/scaffolds 2415 15
Number of ambiguous bases (Ns) 1574 bp 323 bp
GC contents 51.55% 52.23%
Number of predicted genes 11,453 13,405
Mean gene length (in amino acids) 405.8 aa 377.8 aa
Average number of exons per gene 5.64 5.39

Fig. 2.

Fig. 2

Summary of single nucleotide changes. (a) Distribution of SNPs by genetic positions, (b) ratio of mutational changes for across genetic positions, and (c) length distribution of insertions and deletions.

2.2. H. annosum-specific genes and unique genomic regions in H. irregulare

From the protein clustering result of the two species, we found 735 and 1624 species-specific genes in 616 and 1312 clusters of H. annosum and H. irregulare, respectively. Only 40 of 735 (5.44%) protein sequences, belonging to H. annosum-specific clusters, had protein domains predicted by InterPro scan. It showed huge difference to the fact that 58.38% of whole protein sequences in H. annosum had at least one domain. We hypothesized that species-specific sequences may be more unstructured than the rest of the proteome, since they presumably have emerged in comparatively recent date. In fact, 25.79% and 30.52% of amino acids in the H. annosum and H. irregulare-specific protein sequences were predicted to be global and local disorder, respectively. The rest of protein sequences in H. annosum, however, showed that 15.33% and 18.33% of amino acid residues were predicted to be locally and globally unstructured, respectively. It was also true for H. irregulare-specific proteins. Among the H. irregulare-specific proteins, 32.58% and 39.38% of amino acid residues were predicted as disordered, while 15.81% and 19.08% were marked as disordered in the others. It supports the fact that the proteins coded by species-specific genes tend to be unstructured than those conserved throughout other species [15]. It may address the observation that H. annosum-specific proteins had much less predicted domains compared to the whole proteome.

In order to see distribution of the H. annosum and H. irregulare-specific genes in other species, BLAST search was performed against the 37 rot fungi including the two Heterobasidion spp. A total of 468 and 806 of H. annosum and H. irregulare-specific protein sequences, respectively, did not show any hits in the other rot fungi genomes. In addition, 464 out of the 468 H. annosum sequences did not have any significant hits when searched against the NCBI non-redundant (NR) database, suggesting that orphan genes may encode them. Interestingly, four of the 468 protein sequences only had hits from bacterial species in the BLAST search, of which the hits were found in mainly sphingomonads belonging to the genera Sphingobium and Novosphingobium (Table S1). These bacterial species have been reported to have similar natural wood habitat as H. annosum [16]. Moreover, some isolates of these species are capable of degrading lignin or aromatic compounds like lignin-derived monoaryls and biaryls [17], [18]. Since these genes were not found in H. irregulare, it is speculated that genes encoding these proteins might be acquired from cohabiting sphingomonad bacteria. On the other hand, there were H. irregulare-specific regions retrieved from the dot plot (Fig. S1), which are 10 kb or longer. A total 1.93 Mb covering 35 regions showed that the repetitive sequences were enriched in comparison with the rest of H. irregulare genome sequences (Table S2). These regions encompassed 633 genes, of which 278 genes did not show any significant hits in any species other than H. irregulare from BLASTP searches against the NR database. Furthermore, 190 out of 278 genes were not found in H. annosum (Fig. S2). The results suggest that species-specific evolution is differently occurring in the two species for adapting to their own niches.

2.3. Evolutionary implication of gene synteny between the two Heterobasidion spp.

In order to speculate potential driving force underpinning the speciation of the two Heterobasidion spp., macroscopic and microscopic comparisons were conducted as shown above. The macroscopic comparison showed that the majority of contigs of H. annosum genome were mapped against the scaffolds in H. irregulare. According to SNP/InDel identification result, which is a microscopic level of genome comparison, six SNPs were identified in every 1 kb, on average. These comparisons showed that there might have been prolonged genomic rearrangements for adapting to their own niches. To further compare the two genomes at an intermediate scale, genome-wide gene synteny was investigated as well as non-syntenic regions in between. As a result, a total of 528 sets of syntenic genes was identified, encompassing 6894 genes (Table S3). Many sets of genes had small non-syntenic regions, which could be caused by different gene models predicted in two genome sequences. As a result, insertions/deletions of a few kb could be frequently found. For example, around 6 kb of insertion in H. annosum contained five predicted genes (Fig. 3). Interestingly, three of them had the best hit against the same gene in H. irregulare, a cytochrome P450 protein-encoding gene, implying that the inserted region might be caused by duplication. Another example showed that an insertion harboring four genes was found in H. irregulare (Fig. 4). The synteny analysis showed that there are many subtle differences at small and intermediate scale between the two genomes of Heterobasidion spp. A series of insertions/deletions of a few kb might have contributed to the adaptation for each species at different geographic locations.

Fig. 3.

Fig. 3

An example of non-syntenic regions found in H. annosum. (a) A dot plot presenting the alignment between H. annosum contig98 (64,767 bp) on X-axis and the corresponding region of H. irregulare on Y-axis. (b) Closer outlook of genomic context in H. annosum where an insertion containing the five predicted genes might occurred. (c) Genomic context showing the corresponding region in H. irregulare.

Fig. 4.

Fig. 4

An example of non-syntenic regions found in H. irregulare. (a) A dot plot presenting the alignment between H. irregulare scaffold_08:340,000–413,000 on X-axis and H. annosum contig120 on Y-axis. (b) Overview of genomic context in H. irregulare where an insertion containing the four predicted genes might occurred. (c) Genomic context showing the corresponding region in H. annosum.

2.4. Comparative gene family analysis of 37 rot fungi including Heterobasidion spp.

Many wood-decaying fungi usually have the ability to deconstruct plant materials, including cellulose, hemicellulose and lignin. In particular, white-rot fungi are well known for their ability to delignify wood materials [19]. Enzymes involved in lignin degradation, such as peroxidases and laccases, have been highlighted for their ecological significance and potential applications in industry [20], [21]. Accordingly, the identification of gene families involved in plant material degradation was conducted with 37 genome sequences of rot fungi (Table S4). For the two Heterobasidion spp., the results of identified gene families closely resembled with each other, since their genomes showed high similarity and they cause the same disease. Interestingly, CAZymes, aryl-alcohol oxidases (AAOs), peroxidases and laccases were more frequently found in white-rot fungi than in brown-rot with statistical significance (t-test; P < 0.0078) (Fig. S3). Considering the fact that AAOs, peroxidases and laccases are known to be major players in lignin degradation [22], it might suggest that white-rot fungi would have better performance in lignin degradation.

On the other hand, secretory proteins play important roles in plant-microbe interactions, including colonization of host cells, defense against oxidative stress from plant cells, environmental sensing and acquisition of nutrients [23]. When secretory potential was assessed for the whole protein sequences for each species, white-rot fungi showed higher number of putative secretory proteins than brown-rot (t-test; P = 0.0001). This result may possibly be interpreted that fungi encoding more proteins involved in plant material degradation need higher secretory potential to convey them to the appropriate subcellular locations. In H. annosum, for example, considerable number of laccases, peroxidases, AAOs, and CAZymes were predicted to be secretory (Table S4). However, it has been reported that some basidiomycetes do not show the characteristics of either typical white- or brown-rot [24], [25]. It may underlie the plasticity in fungal life styles and imply that classical white and brown-rot separation has a room for improvement. Availability of more sequenced genomes of wood-decaying fungi may help to elucidate the complexity behind this classification.

3. Materials and methods

3.1. Fungal source and preparation of genomic DNA

The strain used for genome sequencing was Heterobasidion annosum P-type (isolate 03012 provided by Kari Korhonen, METLA, Finland). The strain was grown on malt extract medium. Genomic DNA from H. annosum was prepared by following a published method [26]. Mycelium was grinded in liquid nitrogen, and pulverized and incubated in equal volumes of lysis buffer (0.2 M sodium borate, 30 mM EDTA, 1% SDS, pH 9.0 and phenol) at 60 °C for 5 min. After centrifugation, the supernatant was treated with RNase, and afterwards with an equal volume phenol/chloroform (1:1). Genomic DNA was precipitated with 0.7 volume 2-propanol, washed with 70% ethanol, dried, and resuspended in 100–200 μL of double distilled H2O.

3.2. Genome sequencing, assembly and gene prediction

Whole genome shotgun strategy was used in generation of short reads for H. annosum 03012 genome by using Illumina Genome Analyser IIx platform with the insert size of 500 bp. The short reads were assembled by SOAP package including SOAPdenovo (version 2.04) [27] and GapCloser (version 1.12). The generated contigs were further used in generation of scaffolds by SSPACE (version 2.0) [28]. Gene prediction was performed by using MAKER (version 2.28) [29] pipeline, integrating the results from SNAP (version 2006-07-28) [30], AUGUSTUS (version 2.7) [31] and GeneMark-ES (version 2.3e) [32].

3.3. Preparation of genome sequences of selected rot fungi

Genome sequences of 36 rot fungi were downloaded from MycoCosm at U.S. Department of Energy Joint Genome Institute website [14]. The data set includes 13 brown-rot and 23 white-rot fungi (Table S5). All the genome data was also archived in the standardized data warehouse of Comparative Fungal Genomics Platform 2.0 (CFGP 2.0; http://cfgp.snu.ac.kr/) for further analysis [33].

3.4. Whole genome alignment and detection of structural variations

In order to see genome-wide conservation, contig sequences of H. annosum were aligned against each of the scaffold sequences of H. irregulare by using BLAT [34]. By taking the hits showing ≥ 1 kb matches into consideration, dot plots for each H. irregulare scaffolds were created by using Gepard (version 1.30) [35]. The best hit for each H. annosum contig against the H. irregulare genome was collected for calculation of the net aligned regions between the two genomes. Single nucleotide polymorphisms (SNPs), short insertions and deletions were identified and analyzed by using BWA (version 0.5.9-r16) [36], SAMtools (version 0.1.18) [37] and snpEff (version 3.4i) [38]. The reads were trimmed from 3′ termini if base quality is < 20 in the Phred scale. The SNPs supported by < 4 reads or > 100 reads were discarded for under- and over-representation. The size of upstream and downstream for each transcript were set to be 1 kb each.

3.5. Identification of gene synteny between the Heterobasidion spp.

In order to identify syntenic regions between H. annosum and H. irregulare genomes, candidate orthologous gene pairs were defined by using BLASTP [39]. Protein sequences in H. annosum were used as query and searched against those in H. irregulare, followed by filtering the best hits. Syntenic regions were identified by using DAGchainer [40] with default settings, and the micro-syntenic differences were shown with the selected regions.

3.6. Gene family annotation

Previously published pipelines were used in identification of putative genes encoding plant cell wall-degrading enzymes (PCWDEs), secretory proteins and peroxidases [23], [33], [41], [42]. To capture the maximum extent of PCWDEs, carbohydrate-active enzymes (CAZymes) were identified by using dbCAN [43] with the E-value cutoff of 1e − 5. And an in-house pipeline was used in laccase and laccase-like multi‑copper oxidase prediction (unpublished). Additional genes involved in lignin degradation, including aryl-alcohol dehydrogenase (AAD), aryl-alcohol oxidase (AAO) and quinone reductase (QR), were identified based on protein domain profiles predicted by InterPro scan [44] (Table S6) The domain profiles were retrieved by using the protein sequences annotated as AAD, AAO and QR in UniProt database [45].

3.7. Protein clustering analysis

Whole proteome sequences were subjected to Markov clustering using tribeMCL [46]. All-by-all BLASTP was performed to prepare input data for clustering analysis with the E-value cutoff of 1e − 5 and the inflation factor of 2.0.

3.8. Prediction of unstructured protein regions

Whole proteome sequences of the two Heterobasidion spp. were scanned by using IUPred, a software predicting global and short disorder in proteins sequences [47]. Subsequently amino acid residues showing disorder tendency above 0.5 were considered as disordered. In addition, residues predicted as structured were removed from the list of local and global disorder.

4. Data deposition

The draft genome sequence of Heterobasidion annosum s.s. isolate 03012 was deposited at DDBJ/EMBL/GenBank under accession no. AOSL00000000.

The following are the supplementary data related to this article.

Supplementary material

mmc1.docx (686.9KB, docx)
Table S3

List of the syntenic gene sets between the two genomes of Heterobasidion spp.

mmc2.xlsx (389.2KB, xlsx)

Transparency document

Transparency document.

mmc3.pdf (201.5KB, pdf)

Acknowledgments

Acknowledgements

This work was supported by the National Research Foundation of Korea grants funded by the Ministry of Science, ICT & Future Planning (NRF-2014R1A2A1A10051434, NRF-2015M3A9B8028679) and the Cooperative Research Program for Agricultural Science & Technology Development (PJ01115401), Rural Development Administration, Republic of Korea. KTK and JJ are grateful for a graduate fellowship through the Brain Korea 21 Plus Program. This work was also supported by the Finland Distinguished Professor Programme (FiDiPro #138116) from Academy of Finland to YHL.

Footnotes

The Transparency document associated with this article can be found, in the online version.

References

  • 1.Asiegbu F.O., Adomas A., Stenlid J. Conifer root and butt rot caused by Heterobasidion annosum (Fr.) Bref. s.l. Mol. Plant Pathol. 2005;6:395–409. doi: 10.1111/j.1364-3703.2005.00295.x. [DOI] [PubMed] [Google Scholar]
  • 2.Hodges C.S. Heterobasidion annosum. Biology, ecology, impact and control. Plant Pathol. 1999;48:564–565. [Google Scholar]
  • 3.Garbelotto M., Gonthier P. Biology, epidemiology, and control of Heterobasidion species worldwide. Annu. Rev. Phytopathol. 2013;51:39–59. doi: 10.1146/annurev-phyto-082712-102225. [DOI] [PubMed] [Google Scholar]
  • 4.Lind M., Stenlid J., Olson Å. Heterobasidion annosum s.l. genomics. Adv. Bot. Res. 2014;70:371–396. [Google Scholar]
  • 5.Sillo F., Garbelotto M., Friedman M., Gonthier P. Comparative genomics of sibling fungal pathogenic taxa identifies adaptive evolution without divergence in pathogenicity genes or genomic structure. Genome Biol. Evol. 2015;7:3190–3206. doi: 10.1093/gbe/evv209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Dalman K., Olson A., Stenlid J. Evolutionary history of the conifer root rot fungus Heterobasidion annosum sensu lato. Mol. Ecol. 2010;19:4979–4993. doi: 10.1111/j.1365-294X.2010.04873.x. [DOI] [PubMed] [Google Scholar]
  • 7.Olson A., Aerts A., Asiegbu F., Belbahri L., Bouzid O., Broberg A., Canback B., Coutinho P.M., Cullen D., Dalman K., Deflorio G., van Diepen L.T.A., Dunand C., Duplessis S., Durling M., Gonthier P., Grimwood J., Fossdal C.G., Hansson D., Henrissat B., Hietala A., Himmelstrand K., Hoffmeister D., Hogberg N., James T.Y., Karlsson M., Kohler A., Kues U., Lee Y.H., Lin Y.C., Lind M., Lindquist E., Lombard V., Lucas S., Lunden K., Morin E., Murat C., Park J., Raffaello T., Rouze P., Salamov A., Schmutz J., Solheim H., Stahlberg J., Velez H., de Vries R.P., Wiebenga A., Woodward S., Yakovlev I., Garbelotto M., Martin F., Grigoriev I.V., Stenlid J. Insight into trade-off between wood decay and parasitism from the genome of a fungal forest pathogen. New Phytol. 2012;194:1001–1013. doi: 10.1111/j.1469-8137.2012.04128.x. [DOI] [PubMed] [Google Scholar]
  • 8.Dalman K., Himmelstrand K., Olson A., Lind M., Brandstrom-Durling M., Stenlid J. A genome-wide association study identifies genomic regions for virulence in the non-model organism Heterobasidion annosum s.s. PLoS One. 2013;8 doi: 10.1371/journal.pone.0053525. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Van der Nest M.A., Olson A., Karlsson M., Lind M., Dalman K., Brandstrom-Durling M., Elfstrand M., Wingfield B.D., Stenlid J. Gene expression associated with intersterility in Heterobasidion. Fungal Genet. Biol. 2014;73:104–119. doi: 10.1016/j.fgb.2014.10.008. [DOI] [PubMed] [Google Scholar]
  • 10.Lunden K., Danielsson M., Durling M.B., Ihrmark K., Nemesio Gorriz M., Stenlid J., Asiegbu F.O., Elfstrand M. Transcriptional responses associated with virulence and defence in the interaction between Heterobasidion annosum s.s. and Norway spruce. PLoS One. 2015;10 doi: 10.1371/journal.pone.0131182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Raffaello T., Chen H., Kohler A., Asiegbu F.O. Transcriptomic profiles of Heterobasidion annosum under abiotic stresses and during saprotrophic growth in bark, sapwood and heartwood. Environ. Microbiol. 2014;16:1654–1667. doi: 10.1111/1462-2920.12321. [DOI] [PubMed] [Google Scholar]
  • 12.T. Niemelä, K. Korhonen, Taxonomy of the genus Heterobasidion, Heterobasidion annosum. Biology, Ecology, Impact and Control. Ed. xWoodward, S., Stenlid, J., Karjalainen, R. & Hüttermann, A., (1998).
  • 13.Otrosina W.J., Garbelotto M. Heterobasidion occidentale sp. nov. and Heterobasidion irregulare nom. nov.: a disposition of North American Heterobasidion biological species. Fungal Biol. 2010;114:16–25. doi: 10.1016/j.mycres.2009.09.001. [DOI] [PubMed] [Google Scholar]
  • 14.Grigoriev I.V., Nikitin R., Haridas S., Kuo A., Ohm R., Otillar R., Riley R., Salamov A., Zhao X., Korzeniewski F., Smirnova T., Nordberg H., Dubchak I., Shabalov I. MycoCosm portal: gearing up for 1000 fungal genomes. Nucleic Acids Res. 2014;42:D699–D704. doi: 10.1093/nar/gkt1183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Arendsee Z.W., Li L., Wurtele E.S. Coming of age: orphan genes in plants. Trends Plant Sci. 2014;19:698–708. doi: 10.1016/j.tplants.2014.07.003. [DOI] [PubMed] [Google Scholar]
  • 16.Sun H., Terhonen E., Koskinen K., Paulin L., Kasanen R., Asiegbu F.O. The impacts of treatment with biocontrol fungus (Phlebiopsis gigantea) on bacterial diversity in Norway spruce stumps. Biol. Control. 2013;64:238–246. [Google Scholar]
  • 17.Chen Y., Chai L., Tang C., Yang Z., Zheng Y., Shi Y., Zhang H. Kraft lignin biodegradation by Novosphingobium sp. B-7 and analysis of the degradation process. Bioresour. Technol. 2012;123:682–685. doi: 10.1016/j.biortech.2012.07.028. [DOI] [PubMed] [Google Scholar]
  • 18.Masai E., Kamimura N., Kasai D., Oguchi A., Ankai A., Fukui S., Takahashi M., Yashiro I., Sasaki H., Harada T., Nakamura S., Katano Y., Narita-Yamada S., Nakazawa H., Hara H., Katayama Y., Fukuda M., Yamazaki S., Fujita N. Complete genome sequence of Sphingobium sp. strain SYK-6, a degrader of lignin-derived biaryls and monoaryls. J. Bacteriol. 2012;194:534–535. doi: 10.1128/JB.06254-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Gao D., Du L., Yang J., Wu W.M., Liang H. A critical review of the application of white rot fungus to environmental pollution control. Crit. Rev. Biotechnol. 2010;30:70–77. doi: 10.3109/07388550903427272. [DOI] [PubMed] [Google Scholar]
  • 20.Puhse M., Szweda R.T., Ma Y., Jeworrek C., Winter R., Zorn H. Marasmius scorodonius extracellular dimeric peroxidase — exploring its temperature and pressure stability. Biochim. Biophys. Acta. 2009;1794:1091–1098. doi: 10.1016/j.bbapap.2009.03.015. [DOI] [PubMed] [Google Scholar]
  • 21.Torres-Duarte C., Vazquez-Duhalt R. Applications and prospective of peroxidase biocatalysis in the environmental field. In: Torres E., Ayala M., editors. Biocatalysis Based on Heme Peroxidases: Peroxidases as Potential Industrial Biocatalysts. Springer Berlin Heidelberg; Berlin, Heidelberg: 2010. pp. 179–206. [Google Scholar]
  • 22.Hernandez-Ortega A., Ferreira P., Martinez A.T. Fungal aryl-alcohol oxidase: a peroxide-producing flavoenzyme involved in lignin degradation. Appl. Microbiol. Biotechnol. 2012;93:1395–1410. doi: 10.1007/s00253-011-3836-8. [DOI] [PubMed] [Google Scholar]
  • 23.Choi J., Park J., Kim D., Jung K., Kang S., Lee Y.H. Fungal secretome database: integrated platform for annotation of fungal secretomes. BMC Genomics. 2010;11:105. doi: 10.1186/1471-2164-11-105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Floudas D., Held B.W., Riley R., Nagy L.G., Koehler G., Ransdell A.S., Younus H., Chow J., Chiniquy J., Lipzen A., Tritt A., Sun H., Haridas S., LaButti K., Ohm R.A., Kues U., Blanchette R.A., Grigoriev I.V., Minto R.E., Hibbett D.S. Evolution of novel wood decay mechanisms in Agaricales revealed by the genome sequences of Fistulina hepatica and Cylindrobasidium torrendii. Fungal Genet. Biol. 2015;76:78–92. doi: 10.1016/j.fgb.2015.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Riley R., Salamov A.A., Brown D.W., Nagy L.G., Floudas D., Held B.W., Levasseur A., Lombard V., Morin E., Otillar R., Lindquist E.A., Sun H., LaButti K.M., Schmutz J., Jabbour D., Luo H., Baker S.E., Pisabarro A.G., Walton J.D., Blanchette R.A., Henrissat B., Martin F., Cullen D., Hibbett D.S., Grigoriev I.V. Extensive sampling of basidiomycete genomes demonstrates inadequacy of the white-rot/brown-rot paradigm for wood decay fungi. Proc. Natl. Acad. Sci. U. S. A. 2014;111:9923–9928. doi: 10.1073/pnas.1400592111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Kuo H.C., Wang T.Y., Chen P.P., Chen R.S., Chen T.Y. Genome sequence of Trichoderma virens FT-333 from tropical marine climate. FEMS Microbiol. Lett. 2015;362 doi: 10.1093/femsle/fnv036. [DOI] [PubMed] [Google Scholar]
  • 27.Luo R., Liu B., Xie Y., Li Z., Huang W., Yuan J., He G., Chen Y., Pan Q., Liu Y., Tang J., Wu G., Zhang H., Shi Y., Liu Y., Yu C., Wang B., Lu Y., Han C., Cheung D.W., Yiu S.M., Peng S., Xiaoqian Z., Liu G., Liao X., Li Y., Yang H., Wang J., Lam T.W., Wang J. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience. 2012;1:18. doi: 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Boetzer M., Henkel C.V., Jansen H.J., Butler D., Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27:578–579. doi: 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
  • 29.Cantarel B.L., Korf I., Robb S.M.C., Parra G., Ross E., Moore B., Holt C., Alvarado A.S., Yandell M. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 2008;18:188–196. doi: 10.1101/gr.6743907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Korf I. Gene finding in novel genomes. BMC Bioinf. 2004;5:59. doi: 10.1186/1471-2105-5-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Stanke M., Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics. 2003;19:ii215–ii225. doi: 10.1093/bioinformatics/btg1080. [DOI] [PubMed] [Google Scholar]
  • 32.Ter-Hovhannisyan V., Lomsadze A., Chernoff Y.O., Borodovsky M. Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Res. 2008;18:1979–1990. doi: 10.1101/gr.081612.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Choi J., Cheong K., Jung K., Jeon J., Lee G.W., Kang S., Kim S., Lee Y.W., Lee Y.H. CFGP 2.0: a versatile web-based platform for supporting comparative and evolutionary genomics of fungi and oomycetes. Nucleic Acids Res. 2013;41:D714–D719. doi: 10.1093/nar/gks1163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kent W.J. BLAT—the BLAST-like alignment tool. Genome Res. 2002;12:656–664. doi: 10.1101/gr.229202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Krumsiek J., Arnold R., Rattei T. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 2007;23:1026–1028. doi: 10.1093/bioinformatics/btm039. [DOI] [PubMed] [Google Scholar]
  • 36.Li H., Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26:589–595. doi: 10.1093/bioinformatics/btp698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., Proc G.P.D. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Cingolani P., Platts A., Wang L.L., Coon M., Nguyen T., Wang L., Land S.J., Lu X.Y., Ruden D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w(1118); iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L. BLAST +: architecture and applications. BMC Bioinf. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Haas B.J., Delcher A.L., Wortman J.R., Salzberg S.L. DAGchainer: a tool for mining segmental genome duplications and synteny. Bioinformatics. 2004;20:3643–3646. doi: 10.1093/bioinformatics/bth397. [DOI] [PubMed] [Google Scholar]
  • 41.Choi J., Detry N., Kim K.T., Asiegbu F.O., Valkonen J.P.T., Lee Y.H. fPoxDB: fungal peroxidase database for comparative genomics. BMC Microbiol. 2014;14:117. doi: 10.1186/1471-2180-14-117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kim K.T., Jeon J., Choi J., Cheong K., Song H., Choi G., Kang S., Lee Y.H. Kingdom-wide analysis of fungal small secreted proteins (SSPs) reveals their potential role in host association. Front. Plant Sci. 2016;7:186. doi: 10.3389/fpls.2016.00186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Yin Y.B., Mao X.Z., Yang J.C., Chen X., Mao F.L., Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;40:W445–W451. doi: 10.1093/nar/gks479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Hunter S., Jones P., Mitchell A., Apweiler R., Attwood T.K., Bateman A., Bernard T., Binns D., Bork P., Burge S., de Castro E., Coggill P., Corbett M., Das U., Daugherty L., Duquenne L., Finn R.D., Fraser M., Gough J., Haft D., Hulo N., Kahn D., Kelly E., Letunic I., Lonsdale D., Lopez R., Madera M., Maslen J., McAnulla C., McDowall J., McMenamin C., Mi H., Mutowo-Muellenet P., Mulder N., Natale D., Orengo C., Pesseat S., Punta M., Quinn A.F., Rivoire C., Sangrador-Vegas A., Selengut J.D., Sigrist C.J., Scheremetjew M., Tate J., Thimmajanarthanan M., Thomas P.D., Wu C.H., Yeats C., Yong S.Y. InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 2012;40:D306–D312. doi: 10.1093/nar/gkr948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.UniProt Consortium UniProt: a hub for protein information. Nucleic Acids Res. 2015;43:D204–D212. doi: 10.1093/nar/gku989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Enright A.J., Van Dongen S., Ouzounis C.A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30:1575–1584. doi: 10.1093/nar/30.7.1575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Dosztanyi Z., Csizmok V., Tompa P., Simon I. IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics. 2005;21:3433–3434. doi: 10.1093/bioinformatics/bti541. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.docx (686.9KB, docx)
Table S3

List of the syntenic gene sets between the two genomes of Heterobasidion spp.

mmc2.xlsx (389.2KB, xlsx)

Transparency document.

mmc3.pdf (201.5KB, pdf)

Articles from Genomics Data are provided here courtesy of Elsevier

RESOURCES