We report here a draft genome sequence of an ex-type strain of Aspergillus neoellipticus, NRRL 5109, which was isolated from pus of a case of chronic emphysema. The final assembly consists of 160 scaffolds totaling 27.55 Mbp (G+C content, 49.96%) and 8,858 predicted genes.
ABSTRACT
We report here a draft genome sequence of an ex-type strain of Aspergillus neoellipticus, NRRL 5109, which was isolated from pus of a case of chronic emphysema. The final assembly consists of 160 scaffolds totaling 27.55 Mbp (G+C content, 49.96%) and 8,858 predicted genes.
ANNOUNCEMENT
Aspergillus fumigatus sensu lato is known as a saprophytic fungus and an important opportunistic human pathogen distributed worldwide (1). Toxins produced by A. fumigatus endanger the health of humans and animals, resulting in huge economic losses (2). However, because of the superficial morphological resemblance among different species, expertise in Aspergillus taxonomy is required to recognize the taxa. There is controversy as to whether Aspergillus neoellipticus should be considered conspecific with A. fumigatus or treated as a separate species (3, 4). We attempted to quickly identify isolates of Aspergillus fumigatus and related taxa by designing five specific primers based on the sequences of gene RBPII and test primers on 34 isolates from 20 Aspergillus species and related taxa. The results showed that two pairs of primers, F2625/R3438 and F2255/R3044 (Y. Yu, L. Wang, Y. Wang, Z. Zeng, and Z. Yu, unpublished data), were able to specifically amplify DNA of 11 strains of A. fumigatus and A. neoellipticus but not that of 18 other species of Aspergillus tested. The Aspergillus neoellipticus isolate NRRL 5109T was provided by the State Key Laboratory of Mycology (Institute of Microbiology, Chinese Academy of Sciences, Beijing, China) and had originated from pus from a case of chronic emphysema. Therefore, to gain a deeper understanding of this isolate, including pathogenicity and gene expression between closely related species using comparative genome analysis, especially to ascertain whether it should be regarded a separate species, the genome was sequenced.
The strain was cultured in potato-dextrose agar at 25°C for 5 days, and genomic DNA was recovered from fresh fungal cultures using a cetyltrimethylammonium bromide (CTAB) extraction method. The sequencing library was generated using the NEB Nextera DNA library prep kit (Illumina, Inc., San Diego, CA, USA). Whole-genome sequencing was performed by Génome Québec (Montreal, Canada) using a HiSeq 2000 platform, specifying 125-bp paired-end reads with a 300-bp insertion. Then, 21,147,236 paired-end reads totaling 6.2 Gb were generated. The quality of the reads was assessed with the program FastQC. The result showed consistent read quality that decreased steadily over the length of the reads, so the reads were used directly without trimming for de novo assembly. The genome was de novo assembled with the programs Velvet version 1.2.10 (5), ABySS version 1.9.0 (6), SOAPdenovo version 2.04 plus GapCloser version 1.1.2 (7), MaSuRCA version 3.2.2 (8), and IDBA-UD version 1.1.3 (9), with odd-numbered k-mers between 31 and 91. Assembly quality and gene predictions using AUGUSTUS version 3.2.2 (10) based on gene models from Aspergillus nidulans were assessed by examining the N50 value and using Benchmarking Universal Single-Copy Orthologs (BUSCO) (11). The optimal assembly was obtained using ABySS at a k-mer of 55, with a total length of 27.55 Mb, consisting of 160 scaffolds, with the largest scaffold measuring 2,127,822 bp, and an N50 value of 886,472 bp. BUSCO based on OrthoDB version 9 data sets for Ascomycota identified 1,301 out of 1,315 complete genes in the assembly genome, containing 1,299 complete and single-copy genes. A BLASTP search using the 8,858 predicted genes against RefSeq and the two annotated genomes of A. fumigatus (GenBank accession numbers NC_007194 and DS499594) revealed significant matches (E value, 1E−5) for 8,789, 8,544, and 8,575 proteins, respectively. Compared with the sequences of A. fumigatus, this genome sequence could further help verify the species status of A. neoellipticus.
Data availability.
This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number QDFW00000000. Raw sequence reads (Illumina) have been deposited in the NCBI Sequence Read Archive under the BioProject number PRJNA449781.
ACKNOWLEDGMENTS
This research was supported by National Natural Science Foundation of China (grant 31570022) and the State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences.
REFERENCES
- 1.Latgé JP. 1999. Aspergillus fumigatus and aspergillosis. Clin Microbiol Rev 12:310–350. doi: 10.1128/CMR.12.2.310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Taylor RTD, Nancy PK. 2009. Pathogenesis of Aspergillus fumigatus in invasive aspergillosis. Clin Microbiol Rev 22:447–465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Li F, Wang B, Wang L, Cao B. 2014. Phylogenetic analyses on the diversity of Aspergillus fumigatus sensu lato based on five orthologous loci. Mycopathologia 178:163–176. doi: 10.1007/s11046-014-9790-0. [DOI] [PubMed] [Google Scholar]
- 4.Peterson SW. 2008. Phylogenetic analysis of Aspergillus species using DNA sequences from four loci. Mycologia 100:205–226. doi: 10.1080/15572536.2008.11832477. [DOI] [PubMed] [Google Scholar]
- 5.Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I. 2009. ABySS: a parallel assembler for short read sequence data. Genome Res 19:1117–1123. doi: 10.1101/gr.089532.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu SM, Peng S, Zhu X, Liu G, Liao X, Li Y, Yang H, Wang J, Lam TW, Wang J. 2012. SOAPdenovo2: an empirically improved memory-efficient short read de novo assembler. GigaScience 1:18. doi: 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zimin AV, Marçais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. 2013. The MaSuRCA genome assembler. Bioinformatics 29:2669–2677. doi: 10.1093/bioinformatics/btt476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Peng Y, Leung HCM, Yiu SM, Chin FYL. 2012. IDBA-UD: a de novo assembler for single cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28:1420–1428. doi: 10.1093/bioinformatics/bts174. [DOI] [PubMed] [Google Scholar]
- 10.Stanke M, Steinkamp R, Waack S, Morgenstern B. 2004. AUGUSTUS: a Web server for gene finding in eukaryotes. Nucleic Acids Res 32:W309–W312. doi: 10.1093/nar/gkh379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number QDFW00000000. Raw sequence reads (Illumina) have been deposited in the NCBI Sequence Read Archive under the BioProject number PRJNA449781.
