Abstract
The draft genome of Xylaria sp. isolate MSU_SB201401, causal agent of taproot decline of soybean in the southern U.S., is presented here. The genome assembly was 56.7 Mb in size with an L50 of 246. A total of 10,880 putative protein-encoding genes were predicted, including 647 genes encoding carbohydrate-active enzymes and 1053 genes encoding secreted proteins. This is the first draft genome of a plant-pathogenic Xylaria sp. associated with soybean. The draft genome of Xylaria sp. isolate MSU_SB201401 will provide an important resource for future experiments to determine the molecular basis of pathogenesis.
Specifications Table
| Subject area | Biology |
| More specific subject area | Plant Pathology, Mycology, Genomics |
| Type of data | Genomic sequence, gene prediction and annotation of Xylaria sp. isolate MSU_SB201401 |
| How data was acquired | Whole genome was sequenced with an Ion Torrent PGM |
| Data format | Draft genome assembly and gene annotation |
| Experimental factors | The fungus was cultured on yeast extract peptone dextrose medium at room temperature, and DNA was extracted with a modified CTAB protocol |
| Experimental features | The genome was assembled with SPAdes version 3.6 and annotated with MAKER version 2.31.6 |
| Data source location | Isolate MSU_SB201401 was collected from diseased soybean roots in Louisiana, USA; genome sequencing was performed at the University of Arkansas, USA |
| Data accessibility | The whole genome shotgun project of Xylaria sp. isolate MSU_SB201401 has been deposited at DDBJ/ENA/GenBank under the accession number NPFG00000000; the version described in this paper is version NPFG01000000 |
Value of the data
-
•
The first draft genome of Xylaria sp. isolate MSU_SB201401, a causal agent of taproot decline on soybean, is presented.
-
•
Mechanisms of pathogenesis in the genus Xylaria are poorly understood.
-
•
The draft genome will accelerate functional genomics research and help to understand the molecular basis of pathogenicity.
1. Data
The genus Xylaria (Xylariales, Xylariaceae) is the largest genus in the family Xylariaceae. The genus is mostly comprised of saprophytes associated with dead organic matter and endophytes commonly associated with crops like soybean and barley [1]. Morphological features of the genus include large stromatal tissue containing multiple perithecia, cylindrical asci, and dark ascospores with a germ slit [2]. Interestingly, few species of Xylaria have been reported as pathogens [3]. Plant-pathogenic species of Xylaria include Xylaria mali, the causal agent of root rot of apple in the southern and midwestern United States [4], and Xylaria arbuscula associated with quick decline of Macadamia trees [5]. Recently, a taxonomically unresolved member of Xylaria was reported as the causal organism for taproot decline of soybean in the southern U.S., especially along the Mississippi River Valley [6]. Initial phylogenetic analyses of the isolate with four nuclear loci placed the isolate within the Xylaria arbuscula species aggregate [6]. Soybean plants affected by taproot decline often exhibit interveinal chlorosis in the leaves of the lower canopy during vegetative growth stages, which increases in severity as plants mature through reproductive stages of growth and can be observed in the upper canopy during advanced reproductive growth stages (>R6). Taproots and lateral roots display a blackened appearance, with stroma of the fungus frequently visible upon examination. The disease causes premature plant death in severe cases [6].
Taproot decline is the first report of a soybean disease caused by a species of Xylaria. However, this was not the first reported observation of this particular fungal genus associated with soybean. Isolation of an unknown species of Xylaria from soybean seed that had originated in Ethiopia was previously reported [7], but that isolate was not confirmed to be a pathogen of soybean. To accelerate taxonomic resolution, determination of molecular mechanisms underlying pathogenesis, functional genomics research, and the development of molecular detection assays for the pathogen, the genome of Xylaria sp. isolate MSU_SB201401 associated with taproot decline was sequenced and assembled. DNA sequencing with the Ion Torrent PGM platform generated 7,099,268 reads with a mean read length of 281 bp and a total of 2 Gb of data. The resulting genome assembly of Xylaria sp. MSU_SB201401 contained 56.9 Mb, which was notably larger than the 42.8 Mb genome of Xylaria hypoxylon (http://genome.jgi.doe.gov/Xylhyp1/Xylhyp1.home.html) and the 40.9 Mb genome of Xylaria sp. JS573 (GenBank accession JWIU00000000.1). The genome assembly of Xylaria sp. MSU_SB201401 consisted of 6629 contigs (4743 longer than 1 kb), with an N50 of 56 kb, and an L50 of 246. The longest contig spanned 423,920 bp. The GC content of the genome was predicted to be 43.3%. Approximately 14.3% of the genome assembly was comprised of repetitive elements. Transposable elements comprised 7.8% of the genome assembly, with the retrotransposon family TY1_Copia being the most prevalent (6.2% of the genome assembly). A total of 10,881 genes were predicted (average length=1694 bp), encoding proteins with an average length of 496 amino acids. BUSCO analyses of the predicted genes showed 94.7% completeness with 0.3% duplication, 4.0% fragmentation, and 1.3% of genes missing. Of the 10,881 predicted proteins, 10,678 (98%) had at least one homologous sequence in the NCBI nr database (e-value<1e-5). Blast2GO attributed a GO term for 9076 (83%) of the proteins, and 6674 (61%) were functionally annotated. A total of 647 genes encoding carbohydrate-active enzymes (CAZymes) were identified, which included 279 glycoside hydrolases (GHs), 91 glycosyltransferases (GTs), 17 polysaccharide lyases (PLs), 109 carbohydrate esterases (CEs), and 137 auxiliary activities (AAs). Additionally, the genome of Xylaria sp. MSU_SB201401 was predicted to contain 1053 genes encoding secreted proteins, which included 314 CAZymes, and 96 proteases.
Although Xylaria spp. commonly exist as wood decomposers or endophytes, there are limited reports of pathogenic species within the genus. Thus, mechanisms underlying pathogenesis among species of Xylaria are largely unknown. In the context of fundamental research, the genome assembly of Xylaria sp. isolate MSU_SB201401 presented herein will advance the discovery of molecular mechanisms underlying taproot decline of soybean and accelerate taxonomic resolution of the causal organism. In an applied context, the draft genome assembly of Xylaria sp. isolate MSU_SB201401 will facilitate the development of rapid detection assays for the pathogen in plant and environmental samples. Together, these efforts will ultimately help provide important management tools for taproot decline of soybean.
2. Experimental design, materials and methods
2.1. Genomic DNA extraction and sequencing
Three mycelial plugs (3–5 mm2 each) from Xylaria sp. isolate MSU_SB201401 grown on V8 juice agar medium were transferred into yeast extract peptone dextrose medium [8] amended with 100 μg ml−1 carbenicillin (Research Products International, Mt. Prospect, IL, USA) and incubated at room temperature with constant shaking at 100 rpm for 5 days. Fungal tissue was collected by centrifugation at 4000 rpm for 5 min and ground with liquid nitrogen. Genomic DNA was extracted as previously described [9]. The quality and quantity of the genomic DNA were assessed by agarose gel electrophoresis and spectrophotometric analysis, respectively. A 400 bp library was prepared from genomic DNA with the NEBNext Fast DNA Fragmentation and Library Preparation Kit (New England Biolabs, Ipswich, NE, USA) following the manufacturer's instructions. Library quality was evaluated with Agilent Tapestation D1000 tape (Agilent Technologies, Santa Clara, CA, USA). Whole genome sequencing was performed at the University of Arkansas, Fayetteville, AR, USA, with an Ion Torrent Personal Genome Machine (PGM) (Thermo Fisher Scientific, Waltham, MA, USA) and a 318 V2 chip kit.
2.2. Genome assembly and annotation
The genome was assembled with SPAdes version 3.6 [10] with default parameters for Ion Torrent reads. Protein-encoding genes were predicted with the MAKER pipeline version 2.31.6 [11], with support from ab initio predictors SNAP version 2006-07-28 [12] and Augustus version 3.0.2 [13]. Proteins from Xylaria hypoxylon OSC 100004 and Hypoxylon sp. CO27-5 (accessed from http://genome.jgi.doe.gov/programs/fungi/index.jsf) and Daldinia sp. EC12 (GenBank accession MDGZ00000000.1) were also used to support gene prediction, by providing protein homology evidence. Gene completeness of the draft assembly was assessed with Benchmarking Universal Single-Copy Orthologs (BUSCO) software version 2 [14] based on the Ascomycota dataset. The predicted protein sequences were used as queries in a local BLAST search (BLAST version 2.2.31) against the NCBI nr database (update 02/2016). For functional annotation, BLAST search results were analyzed with Blast2GO version 4.1 [15]. CAZymes were identified with dbCAN version 5.0 [16]. Proteins predicted to contain a signal peptide by both SignalP version 4.1 [17] and TargetP version 1.1 [18] were categorized as secreted proteins. Proteases were identified by homology (BLAST) searches against the MEROPS database version 11.0 [19] with a maximum e-value of 1e-5.
Acknowledgements
This work was supported by funding from the Arkansas Soybean Promotion Board, Mississippi Soybean Promotion Board, Louisiana Soybean and Grain Research and Promotion Board, and the USDA National Institute of Food and Agriculture, Hatch projects 1008895 and LAB94338.
References
- 1.Chen J., Zhang L.-C., Xing Y.-M., Wang Y.-Q., Xing X.-K., Zhang D.-W., Liang H.-Q., Guo S.-X. Diversity and taxonomy of endophytic Xylariaceous fungi from medicinal plants of Dendrobium (Orchidaceae) PLoS One. 2013:e58268. doi: 10.1371/journal.pone.0058268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hsieh H.-M., Lin C.-R., Fang M.-J., Rogers J.D., Fournier J., Lechat C., Ju Y.M. Phylogenetic status of Xylaria subgenus Pseudoxylaria among taxa of the subfamily Xylarioideae (Xylariaceae) and phylogeny of the taxa involved in the subfamily. Mol. Phylogenet. Evol. 2010;54:957–969. doi: 10.1016/j.ympev.2009.12.015. [DOI] [PubMed] [Google Scholar]
- 3.Edwards R.L., Jonglaekha N., Kshirsagar A., Maitland D.J., Mekkamol S., Nugent L.K., Phosri C., Rodtong S., Ruchichachorn N., Sangvichien E., Sharples G.P., Sihanonth N., Theinhirun S., Whalley A.J.S., Whalley M.A. The Xylariaceae as phytopathogens. Recent Res. Dev. Plant Sci. 2003;1:1–19. [Google Scholar]
- 4.Rogers J.D. Xylaria acuta, Xylaria cornu-damae, and Xylaria mali in Continental United States. Mycologia. 1984;76:23–33. [Google Scholar]
- 5.Ko W.H., Kunimoto R.K. Quick decline of macadamia trees - association with Xylaria-arbuscula. Plant Pathol. 1991;40:643–644. [Google Scholar]
- 6.Allen T., Bluhm B.H., Conner K., Doyle V., Price T., Sikora E., Singh R., Spurlock T., Tomaso-Peterson M., Wilkerson T. First description of the causal agent of taproot decline of soybean, an emerging disease in the southern United States. Plant Health Prog. 2017;18:35–40. [Google Scholar]
- 7.Mengistu A., Sinchlair J.B. Seedborne microorganisms of Ethiopian-grown soybean and chickpea seeds. Plant Dis. Rep. 1979;63:616–619. [Google Scholar]
- 8.Flaherty J.E., Pirttilä A.M., Bluhm B.H., Woloshuk C.P. PAC1, a pH-Regulatory Gene from Fusarium verticillioides. Appl. Environ. Microbiol. 2003;69:5222–5227. doi: 10.1128/AEM.69.9.5222-5227.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Li S., Ridenour J.B., Kim H., Hirsch R.L., Rupe J.C., Bluhm B.H. Agrobacterium tumefaciens-mediated transformation of the soybean pathogen Phomopsis longicolla. J. Microbiol. Methods. 2013;92:244–245. doi: 10.1016/j.mimet.2012.12.015. [DOI] [PubMed] [Google Scholar]
- 10.Bankevich A., Nurk S., Antipov D., Gurevich A.A., Dvorkin M., Kulikov A.S., Pyshkin A.V. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comp. Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Holt C., Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinform. 2011;12:491. doi: 10.1186/1471-2105-12-491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Korf I. Gene finding in novel genomes. BMC Bioinform. 2004;5:59. doi: 10.1186/1471-2105-5-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Stanke M., Keller O., Gunduz I., Hayes A., Waack S., Morgenstern B. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acid. Res. 2006;34:W435–W439. doi: 10.1093/nar/gkl200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Simão F.A., Waterhouse R.M., Ioannidis P., Kriventseva E.V., Zdobnov E.M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 15.Conesa A., Götz S., García-Gómez J.M., Terol J., Talón M., Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–3676. doi: 10.1093/bioinformatics/bti610. [DOI] [PubMed] [Google Scholar]
- 16.Yin Y., Mao X., Yang J., Chen X., Mao F., Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;40:W445–W451. doi: 10.1093/nar/gks479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Petersen T.N., Brunak S., von Heijne G., Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat. Methods. 2011;8:785–786. doi: 10.1038/nmeth.1701. [DOI] [PubMed] [Google Scholar]
- 18.Emanuelsson O., Nielsen H., Brunak S., Von Heijne G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J. Mol. Biol. 2000;300:1005–1016. doi: 10.1006/jmbi.2000.3903. [DOI] [PubMed] [Google Scholar]
- 19.Rawlings N.D., Barrett A.J., Bateman A. MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2011;40:D343–D350. doi: 10.1093/nar/gkr987. [DOI] [PMC free article] [PubMed] [Google Scholar]
