Aspergillus tubingensis WU-2223L, belonging to the section Nigri, is a hyperproducer of citric acid. Here, we present the high-quality draft (35 Mb) and mitochondrial (32.4 kb) genome sequences of this strain, which consisted of 16 scaffolds in total. The draft and mitochondrial genome sequences comprised 11,493 and 15 genes, respectively.
ABSTRACT
Aspergillus tubingensis WU-2223L, belonging to the section Nigri, is a hyperproducer of citric acid. Here, we present the high-quality draft (35 Mb) and mitochondrial (32.4 kb) genome sequences of this strain, which consisted of 16 scaffolds in total. The draft and mitochondrial genome sequences comprised 11,493 and 15 genes, respectively.
ANNOUNCEMENT
Citric acid (CA) is industrially produced by fermentation using the filamentous fungus Aspergillus niger (1). Strain WU-2223L, isolated in 1960 from Japanese soil and identified as A. niger (2), was reidentified as A. tubingensis based on recent phylogenetic classification (3). Since strain WU-2223L produces large amounts of CA (>50% [wt/wt] from supplied glucose) (2, 4, 5), the genome sequence of A. tubingensis WU-2223L will enhance our understanding of the mechanisms underlying CA hyperproduction.
A. tubingensis WU-2223L, deposited as NBRC111403 in the National Institute of Technology and Evaluation Biological Resource Center and as IFM 65578 in the Medical Mycology Research Center, Chiba University (through the National BioResource Project [NBRP]), was used in this study. The genome sequence was produced using a hybrid assembly approach of Illumina and Oxford Nanopore Technologies (ONT; Cambridge, UK). The genomic DNA of A. tubingensis WU-2223L was extracted from mycelia cultivated in potato dextrose broth at 25°C for 1 day using phenol-chloroform extraction and a Genomic-tip 20/G (Qiagen, Hilden, Germany). A DNA library for Illumina sequencing was prepared using a NEBNext Ultra II FS DNA library prep kit for Illumina (New England Biolabs, Ipswich, MA). Paired-end (PE) sequencing of 2 × 150-bp reads was performed by BGI (Kobe, Japan) on a HiSeq X platform (Illumina, San Diego, CA). A DNA library for ONT sequencing was prepared using a short-read eliminator kit (Nippon Genetics, Tokyo, Japan) and a ligation sequencing kit (SQK-LSK109; ONT), and sequencing was performed on a MinION platform (ONT) with a FLO-MIN106D flow cell (R9.4.1; ONT) for 48 h according to the manufacturer’s instructions. A total of 474,042 ONT reads (8.7 Gbp) were generated with a mean length of 18,464 bp using ONT Guppy version 2.3.5 and subsequently filtered using Filtlong version 0.2.0 (https://github.com/rrwick/Filtlong) with Illumina PE reads. The subsampled ONT reads were corrected using Canu version 1.8 (6). The parameter settings for software are shown in Table 1.
TABLE 1.
Parameter settings used for genome sequencing and assembly
| Software | Parameters |
|---|---|
| Guppy | Options --qscore_filtering, --flowcell, FLO-MIN106, --kit SQK-LSK109, --enable_trimming, true, and --cpu_threads_per_caller 10 |
| Filtlong | Options --min_length 100, --keep_percent 90, --target_bases 4000000000, --trim, and --split |
| Canu | Options nanopore-raw and genomeSize=37m |
| BWA | Default |
| Pilon | Default |
| MaSuRCA | Options FLYE_ASSEMBLY=1 and LHE_COVERAGE=25 |
The draft genome of A. tubingensis WU-2223L was assembled using MaSuRCA version 3.3.3 (7) with 19,102,449 PE reads (162× coverage) and 32,827 of the corrected ONT reads (38× coverage; 1.3 Gbp) and was iteratively polished four times using Burrows-Wheeler Aligner (BWA) version 0.7.17 (8) and Pilon version 1.23 (9). A total of 16 scaffolds was obtained, and a total draft genome size of 35,047,259 bp (49.32% G+C content) was determined with that of a mitochondrial genome. The N50 value and the maximum scaffold size were 3,153,776 bp and 5,352,872 bp, respectively. A total of 11,493 protein-coding genes were predicted using FunGAP version 1.0.1 (10) with transcriptome reads of A. tubingensis CBS 134.48 (GenBank accession number PRJNA358285) and an Aspergillus oryzae gene model for AUGUSTUS (11) inside FunGAP. Totals of 270 tRNAs and 68 rRNAs were predicted using tRNAscan-SE version 1.3.1 (12) and RNAmmer version 1.2 (13), respectively. For the mitochondrial genome with 32,393 bp, 15 open reading frames (ORFs) and 26 tRNAs were predicted using GeSeq (14) and tRNAscan-SE, respectively.
Data availability.
The whole and mitochondrial genome sequences of A. tubingensis WU-2223L were deposited at DDBJ/EMBL/GenBank under the accession numbers BLWE01000001.1 to BLWE01000015.1 and LC545447.1, respectively. The raw sequencing reads were submitted to the Sequence Read Archive under the accession number DRA010281.
ACKNOWLEDGMENTS
This work was supported by a Waseda University grant for special research projects (2019R-036). Computations were partially performed on the NIG supercomputer at the ROIS National Institute of Genetics.
REFERENCES
- 1.Kirimura K, Yoshioka I. 2019. Bio-based chemicals, citric acid, p 158–165. In Moo-Young M. (ed), Comprehensive biotechnology, 3rd ed, vol 3 Elsevier, London, United Kingdom. [Google Scholar]
- 2.Usami S, Taketomi N. 1960. Citric acid fermentation of saccharified sweet potato solution. Kogyo Kagaku Zasshi 63:1602–1605. (In Japanese.) doi: 10.1246/nikkashi1898.63.9_1602. [DOI] [Google Scholar]
- 3.Hong S-B, Lee M, Kim D-H, Varga J, Frisvad JC, Perrone G, Gomi K, Yamada O, Machida M, Houbraken J, Samson RA. 2013. Aspergillus luchuensis, an industrially important black Aspergillus in east Asia. PLoS One 8:e63769. doi: 10.1371/journal.pone.0063769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Usami S. 1978. Production of citric acid by submerged culture. Mem Sch Sci Eng 42:17–26. https://ci.nii.ac.jp/naid/10005408580. [Google Scholar]
- 5.Kirimura K, Kobayashi K, Yoshioka I. 2019. Decrease of citric acid produced by Aspergillus niger through disruption of the gene encoding a putative mitochondrial citrate-oxoglutarate shuttle protein. Biosci Biotechnol Biochem 83:1538–1546. doi: 10.1080/09168451.2019.1574205. [DOI] [PubMed] [Google Scholar]
- 6.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zimin AV, Marçais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. 2013. The MaSuRCA genome assembler. Bioinformatics 29:2669–2677. doi: 10.1093/bioinformatics/btt476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Min B, Grigoriev IV, Choi IG. 2017. FunGAP: Fungal Genome Annotation Pipeline using evidence-based gene model evaluation. Bioinformatics 33:2936–2937. doi: 10.1093/bioinformatics/btx353. [DOI] [PubMed] [Google Scholar]
- 11.Stanke M, Morgenstern B. 2005. AUGUSTUS: a Web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Res 33:W465–W467. doi: 10.1093/nar/gki458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. 2017. GeSeq: versatile and accurate annotation of organelle genomes. Nucleic Acids Res 45:W6–W11. doi: 10.1093/nar/gkx391. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The whole and mitochondrial genome sequences of A. tubingensis WU-2223L were deposited at DDBJ/EMBL/GenBank under the accession numbers BLWE01000001.1 to BLWE01000015.1 and LC545447.1, respectively. The raw sequencing reads were submitted to the Sequence Read Archive under the accession number DRA010281.
