Abstract
Aspergillus niger is one of the most important sources of secondary metabolites (SMs), with a wide array of pharmacological effects, including anti-inflammatory, antitumor, immunomodulatory and antioxidant effects. However, the biosynthetic analysis of these bioactive components has been rarely reported owing to the lack of high-quality genome sequences and comprehensive analysis. In this study, the whole genome of one marine-sponge-derived strain A. niger L14 was sequenced and assembled as well as in-depth bioinformatic analysis. The results indicated that the sequence assembly of strain L14 generated one high-quality genome with a total size of 36.1 Mb, a G + C content of 45.3% and an N50 scaffold of 4.2 Mb. Gene annotation was extensively deployed using various BLAST databases, including non-redudant (Nr) protein sequence, nucleotide (Nt) sequence, Swiss-Prot, Gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Clusters of Orthologous Groups (COG) as well as Pathogen Host Interactions (PHI) and Carbohydrate-active enzymes (CAZy) databases. AntiSMASH analysis revealed that this marine strain harbors a total of 69 SMs biosynthesis gene clusters (BGCs), including 17 PKSs, 18 NRPSs, 21 NRPS-likes, 9 terpenes, 2 indoles, 1 betalactone and 1 siderophore, suggesting its biosynthetic potential to produce a wide variety of SMs. These findings will assist in future investigations on the genetic basis of strain L14 and provide insights into its new bioactive SMs for new drug discovery.
Keywords: Aspergillus niger, symbiotic microorganism, genome sequencing, antiSMASH, natural product, biosynthetic potential
1. Introduction
Marine-derived Aspergillus is appraised as a favorable source for discovering new bioactive secondary metabolites (SMs). Currently, marine-derived Aspergillus has been reported about two hundred SMs, which is one of the most widely studied marine fungi in the genus [1]. It mainly includes structural types, such as flavonoids, alkaloids, phenylpropanoids, terpenes, quinones, cyclic peptides and other compounds. Some of these metabolites have shown anti-inflammatory, anti-tumor, antibacterial, free radical scavenging and other biological activities, which may be promising resources for new therapeutic drugs [2,3,4]. The classical pre-genomics assumed that an individual microbial strain could produce only a limited number of SMs, but the first genome sequences Aspergillus published in the 2000s [5] and the subsequent explosion of genome sequencing data in the 2010s demonstrated that these microorganisms have the genetic capability to produce significant numbers of compounds. These unknown metabolic pathways are likely to encode numerous bioactive molecules. One of the most commonly used tools, antiSMASH (antibiotics and Secondary Metabolites Analysis Shell) has had a major impact on genome-to-genome correlation, query and prediction of natural product synthetic gene clusters [6,7].
It is well known that Aspergillus niger is an excellent producer of valuable metabolites, including organic acids (e.g., citric acid, itaconic acid and tensyuic acid) and various enzymes (e.g., polysaccharide-degrading enzymes, amylolytic enzymes) [8,9,10,11]. In our continuous search of bioactive SMs from marine fungi, one fungal strain L14 was obtained from marine sponge Reniera japonica and its chemical study led to isolation of 14 known SMs (fonsecinone A, isoaurasperone A, aurasperone E, asperpyrone C, asperpyrone D, aurasperone G, nigerone, asperpyrone B, tensyuic acid E, tensyuic acid F, 9-hydroxyhexylitaconic acid-1-methyl ester, asperlone B, pyrophen and 2-benzyl-γ-pyrone), most of which are polyketide derivatives (Figure S1). However, these findings were not exciting, since other type SMs were not obtained. It might be ascribed to that fact that most of SM biosynthetic gene clusters (BGCs) in strain L14 were silent or expressed at low levels under conventional culture conditions. In order to better understand and exploit the biosynthetic potential of SMs in depth for new drug discovery, whole-genome sequencing and bioinformative analysis with antiSMASH annotation of strain L14 were extensively carried out in this work.
2. Materials and Methods
2.1. Strain Source
Strain L14 was isolated from one specimen (MNP-2016) of R. japonica, which was collected in September 2016 from the Xinghaiwan Coast of Dalian City, Liaoning Province, China [12]. One suspension of culture containing its mycelia in potato dextrose broth (PDB) supplemented with glycerol (20% v/v) was stored at −80 °C.
2.2. Phylogenetic Analysis
For phylogenetic analysis, strain L14 was cultivated in PDB medium at 28 °C for 3 days followed by 18S rRNA gene amplicon sequencing. Its 18S rRNA sequence with 1680 bp had been submitted to GenBank in NCBI and an accession number MF093522 was acquired. Forty strains with high similarity in NCBI database were selected to construct the phylogenetic tree using MEGA 7.0.26 (Table S1), and the phylogenetic tree was delineated via a neighbor-joining mode which was constructed using the Tamura3 parameter model with 1000 bootstrap replicates.
2.3. DNA Extraction and Whole Genome Sequencing
Genomic DNA of strain L14 was extracted with Gentra Puregene Yeast/Bact. Kit (Qiagen, Valencia, CA, USA) officially recommended by Nanopore [13]. The next-generation and third-generation sequencing technologies were, respectively, utilized Illumina HiSeq 2500 platform and PacBio Sequel platform. DNA libraries of a certain concentration and volume were added to each individual flow cell and the flow cells were transferred to a Grid ION X5 sequencer (Nanopore, Oxford, UK) for real-time single-molecule sequencing. Through a variety of methods, including Homology, SNAP, Augustus, coding genes, repetitive sequences, non-coding RNA, etc., were predicted to obtain the composition of the sequenced strain genome. Repeat sequence prediction was performed by Proteinmask, repeatmasker and trf. The tRNA region was predicted by tRNAscan software, rRNA was predicted by RNAmmer software, miRNA and snRNA were achieved by comparison with Rfam database.
2.4. Genome Assembly
Unicycler software (https://github.com/rrwick/Unicycler, accessed on 10 September 2020) was used for assembly, high-quality data and debug multiple assembly versions were used to select the optimal assembly results [14]. After sequencing, the quality control of data included raw data and clean data quality control files. Clean data were high-quality data obtained by removing connectors and filtering low-quality data according to the original off-line data. Contaminated or low-quality reads, as well as the proportion of N at one read greater than 5% were removed.
2.5. Gene Annotation
Functional genes in whole genome of strain L14 were, respectively, annotated by Pathway, Nr (NCBI non-redundant protein sequences), COG (Cluster of Orthologous Groups of proteins) and KEGG (Kyoto Encyclopedia of Genes and Genomes), as well as GO (Gene Ontology). Through BLAST, the protein sequence was aligned to protein database Nr, GO, KEGG and COG to obtain the highest similarity with a given unigene of the protein containing the function annotation information [15]. Based on Nr annotation information, GO annotation information was obtained by using Blast2GO software. GO functional classification statistics were performed by using WEGO software [16] and KEGG was used to investigate the complex behavior of genes in biology [17]. According to the KEGG annotation information, the pathway annotation of all unigenes was obtained. COG is a database for orthologous classification of gene products. Each COG protein was assumed to be derived from an ancestor protein [18]. The COG database was constructed based on the coding proteins and phylogenetic relationships of bacteria, algae and eukaryotes with complete genomes. The protein sequence was compared by the COG database, the possible function of the protein was also predicted.
2.6. Additional Annotation
Diamond software was used to align the amino acid sequences of the target species with the CAZy database (Carbohydrate-Active Enzymes Database) [19]. Annotation results were screened to ensure biological significance. Meanwhile, each gene in the database contains nucleic acid and amino acid sequences, as well as a detailed description of the protein function predicted during host infection. BLAST software was used to compare the amino acid sequence of the target species with the PHI (Pathogen Host Interactions Database) database (E-value < 0.00001), annotation results were achieved by combining the gene of the target species with its corresponding function annotation information. Our selection criteria for labeling were retaining the comparison results with the highest scores and the query coverage rate was greater than 80%.
2.7. Secondary Metabolic Gene Cluster Analysis
Based on gbk files, nucleic acid sequence and protein sequence, antiSAMSH 6.0 software was used to find secondary metabolic clusters [20]. NCBI database searched for genomic information of strain L14. Generally, genes involved in biosynthetic enzymes in secondary metabolic pathways are clustered on chromosomes. Secondary metabolic gene clusters were classified into 24 categories. The most common secondary metabolic gene clusters are type I, II and III polyketides synthase (PKS) [21,22] and non-ribosomal peptides synthase (NRPS) [23].
3. Results and Discussion
3.1. Morphology and Classification of Strain L14
The white mycelia of strain L14 grew from a potato dextrose agar (PDA) medium after incubation for 3 d at 28 °C, and gradually became darker and its black spores were produced on the seventh day (Figure 1a). Phylogenetic analysis displayed strain L14 had 100% of the 18S rRNA sequence of the reference strain A. niger CBS 554.65 (Figure 1b), suggesting that this isolate unambiguously belongs to the species of A. niger (Table S1).
Figure 1.
Colony morphology (a) and 18S rRNA-based phylogenetic tree (b) of strain L14.
3.2. Genomic Assembly and Analysis
The genome sequencing of strain L14 yielded a sequence with a length of 36,104,262 bp with a G + C content of 49.3% and N50 of 4,203,503 bp. After base calling and Bcl2Fastq transforming, 42,699,652 clean reads were achieved. By extensive search, the genome size of A. niger strains deposited in NCBI database ranged from 34 Mb to 36 Mb; the GC content is approximate 50% (Table 1). As shown in Figure 2, the genome diagram of strain L14 consists of nine circles. As such, 11,524 coding genes were predicted, accounting for 43.23% of the genome. Protein mask, REPEAT masker and TRF were used to predict repeat size (167,544 bp). Repeat masker showed the repeat size is 474,020 bp, 1.31% in Genome. TRF showed the repeat size is 224,849 bp, 0.62% in Genome. The number of long terminal repeat was 166, occupying 0.21%. The number of long interspersed elements was 89, occupying 0.14%. The number of DNA transposons was 82, occupying 0.09%; there were non-coding RNAs, 80 rRNAs, 34 tRNAs and 296 tRNAs (Table 2).
Table 1.
General genome features of strain L14.
Item | Value |
---|---|
Total length (bp) | 36,104,262 |
Contigs | 69 |
GC content (%) | 49.3 |
N50 (bp) | 4,203,503 |
N90 (bp) | 2,686,881 |
Max-length (bp) | 9,886,207 |
Figure 2.
Genome diagram of strain L14 (from inside to outside: the first circle of blue lines shows in-paralog pairs better hits between each other, the E-value between pairs is lower than 1 × 10−5; the second circle shows GC skew, green is the positive part of GC skew, orange is the negative part of GC skew; the third circle shows the GC content; the fourth circle shows ncRNA; the fifth circle shows the repeat; the sixth circle and the seventh circle show the CDS annotation information, and different colors represent different COG annotation classifications; the sixth circle is the CDS in the negative chain; the seventh circle is the CDS in the positive chain; the eighth circle shows the SMs, where the corresponding gene cluster names are marked; the outermost circles show the various scaffolds).
Table 2.
Statistics of strain L14 assembly and annotation.
Type | Repeat/Copy Number | Repeat Size/Total Length (bp) | Average Length (bp) | In Genome% |
---|---|---|---|---|
DNA | 82 | 33,600 | 409 | 0.0931 |
LINE | 89 | 52,257 | 587 | 0.1447 |
LTR | 166 | 79,092 | 476 | 0.2191 |
RC | 5 | 2595 | 519 | 0.0072 |
miRNA | 0 | 0 | 0 | 0 |
rRNA | 80 | 70,794 | 884 | 0.1961 |
snRNA | 34 | 4130 | 121 | 0.0114 |
tRNA | 296 | 560 | 1 | 0.0016 |
3.3. Functional Gene Annotation
The largest number of functional genes in strain L14 was determined as 11,300 genes/98.06% using the Nr database followed by Nt (11,228 genes/97.43%), Swiss-PROt (8147 genes/70.70%), KEGG (7818 genes/67.84%), COG (6162 genes/53.47%) and GO (7464 genes/64.77%). Among these 11,300 genes, 8829 genes have more than 95% similarity with the gene sequences in the Nr database. The results of species annotation showed that the sequence was more likely to be of A. niger strains CBS 523.88 (Figure 3a). It is noteworthy that strain L14 contains 25 Ortholog clusters with various functions, including defense mechanisms, energy production and conversion (Figure 3b). COG analysis suggests that the largest number of genes (2054) belongs to the family of “general function prediction only” followed by “carbohydrate transport and metabolism” (1111), “transcription” (1097) and “amino acid transport and metabolism” (891). Based on GO assignment, 7464 genes were categorized into 90 functional groups. In terms of biological process, genes were detected to be involved in metabolic processes (4339), cellular processes (4103) and single-organism processes (3596). Meanwhile, the number of genes involved in cellular components were cell (3166), cell part (3151), membrane (2594) and organelle (2432). The molecular function revealed that 4056 genes were involved in catalytic activity (4065), followed by binding (3035) (Figure 3c). KEGG annotation obtained global map (3869), carbohydrate metabolism (2075) and amino acid metabolism (1527), which contained a relatively large number of genes (Figure 3d). These findings suggested the presence of an enriched and varied array of carbohydrates and amino metabolism functions that enable higher energy conversion efficiency. Numerous global map function genes indicated that strain L14 can be a very considerable research object for the study of global regulation.
Figure 3.
E-value, similarity and species distribution of strain L14 annotated by Nr database; (a), cluster of orthologous groups of proteins (COG) (b), gene ontology (GO) (c) and Kyoto encyclopedia of genes and genomes (KEGG) (d).
3.4. Additional Annotation
Six carbohydrate-active enzyme (CAZyme) families were identified from the whole genome of strain L14 and consisted of glycol side hydrolases (GH) with 353 genes, glycosyl transferase (GT) with 187 genes, carbohydrate-binding module (CBMs) with 97 genes, carbohydrate esterase (CE) with 46 genes and polysaccharide lyase (PL) with 10 genes, as well as auxiliary activity (AA) with 88 genes (Figure 4a). Particularly, this marine strain possesses the largest number of GHs, suggesting its robust ability to break down lignocellulose [24]. The pathogen–host interaction (PHI) analysis of strain L14 resulted in identification of 2116 genes classified into nine groups, in which the largest is “unaffected pathogenicity” with 850 genes, followed by “reduced virulence” (785 genes), “loss of pathogenicity” (158 genes), “mixed outcome” (155 genes), “lethal” (91 genes), “increased virulence (hypervirulence)” (32 genes), “effector (plant avirulence determinant)” (25 genes), “chemistry target: sensitivity to chemical” (15 genes) and “chemistry target: resistance to chemical” (5 genes) (Figure 4b). The detailed information for these genes is comprehensively summarized in Table S2.
Figure 4.
CAZy (a) and PHI (b) profiles of strain L14.
3.5. AntiSMASH Analysis
Before secondary metabolite-related BGC analysis, the whole genome sequence of strain L14 was submitted to antiSMASH database and compared with the other ten genomes of A. niger strains deposited in the NCBI database. The results suggested that each of these A. niger strains possess a similar number (at least 59) and type of BGCs (Figure 5a). Particularly, strain L14 has the largest number of BGCs classified into seven types, including PKS, NRPS, NRPS-like, terpene, indole, betalactone and siderophore (Table S3), indicating this marine strain displays a vast biosynthetic potential to produce a wide array of SMs. By sequence similarity analysis of all BGCs from these A. niger strains, as many as 45 SMs were predicted and summarized in the hot map (Figure 5b). All putative SMs from strain L14 were marked as blocks in red, the color depth of which positively depended on the similarity of its corresponding BGC sequence (Table S4). Further BLAST analysis resulted in characterization of seven BGCs with over 93% identity, which was, respectively, responsible for the biosynthesis of compounds TAN-1612, yanuthone D, melanin, azaniger-one A, nidulanin A and aureobasidin A and xenolozoyenone (Figure 6 and Figure 7, Tables S5–S11).
Figure 5.
The number and types of secondary metabolite BGCs (a) and their putative products (b) in strain L14 and other A. niger strains deposited in antiSMASH database.
Figure 6.
Seven identified BGCs with over 93% identity in strain L14 responsible for biosynthesis of TAN-1612 (a), nidulanin A (b), yanuthone D (c), melanin (d), aureobasidin A (e); azanigerone A (f) and xenolozoyenone (g).
Figure 7.
Gene location, identity and coverage of seven most similar BGCs in strain L14.
All regions 4.3, 13.2, 14.2 and 35.2 in strain L14 belong to type I PKS. Most of the functional genes in region 4.3 are similar to those in the TAN-1612 BGC from strain A. niger ATCC 1015 (GenBank: JN257714.1). AdaA are the core enzymes in the Ada gene cluster and had been shown to be responsible for the synthesis and cyclization of the decaketone backbone of TAN-1612 [25]. Region 13.2 displays 100% similarity with the BGC (GenBank: ACJE01000012.1), responsible for biosynthesis of yanuthone D, which is a heteroterpene derivative from polyketide 6-methylsalicylic acid (6-MSA) [26]. Region 14.2 harbors a high similar sequence with the BGC (GenBank: ACJE01000001.1) in strain ATCC 1015 for biosynthesis of melanin, which is a major ubiquitous pigment that plays an important role in photoprotection against UV radiation [27,28,29,30]. However, no hydrolase or reductase were detected in this region but phosphopantetheine-binding enzyme was. In addition, region 35.2 exhibits high similarity with the BGC (GenBank: ACJE01000001.1) responsible for biosynthesis of azanigerone A. This aza gene cluster comprises one smallest PKS domain, transacylase (SAT), C-methyltransferase (CMeT) and Zn (II)2Cys6 zinc finger transcription factor [31].
Region 11.1 is one NRPS-related BGC with a higher degree of similarity with nidulanin A BGC from A. nidulans FGSC A4 (GenBank: BN001308.1). This substance is synergistically synthesized by a supercluster cross, which formed with two NRPS, NlsA located on chromosome VIII and NptA located on chromosome V [32], while region 17.1 has similar ABA1 genes from Aureobasidium pullulans (GenBank: EU886741.1) to encode the synthetic complex of the cyclic peptide antibiotic aureobasidin A [33]. Region 62.2 has a significant BLAST hit with the gene encoding GLNRPS7 in Glarea lozoyensis (GenBank: KM603664.1), which is responsible for the biosynthesis of a non-ribosomal peptide xenolozoyenone. The core gene glpks3-glnrps7 had been shown to be clustered with two transporter genes, one oxidoreductase gene and one transcriptional regulatory gene [34].
4. Conclusions
In this study, a high-quality whole genome of marine-derived strain L14 was obtained using next-generation and third-generation sequencing technologies. Strain L14 was undoubtedly identified as A. niger by morphological and 18S rRNA sequence analyses. Genomic analysis indicated that strain L14 has one 36.1 Mb genome with a G + C content of 49.3%. Gene annotation was extensively deployed using various BLAST databases, including Nr protein sequence, Nt sequence, Swiss-Prot, GO, KEGG and COG as well as PHI and CAZy. AntiSMASH analysis uncovered 69 secondary metabolite BGCs in strain L14 and only 7 of them had high similarity with known gene clusters, suggesting most of these BGCs are unknown and yet to be unveiled. This is confirmed by our previous chemical investigation, since a great number of cryptic BGCs in strain L14 are silent or expressed at low levels under conventional conditions. These findings provide an important basis for further chemical study of strain L14 using the gene mining strategy to activate these cryptic BGCs for the production of novel bioactive SMs.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jof8060591/s1, Figure S1: Fourteen secondary metabolites (1–14) previously isolated from strain L14; Table S1: 40 species selected to build the phylogenetic tree in NCBI database; Table S2: PHI database annotation results; Table S3: Secondary metabolite BGCs in strain L14; Table S4: BGC types of SMs of 11 A. niger strains. Table S5: BGC component of TAN-1612 in strain L14; Table S6: BGC component of nidulanin A in strain L14; Table S7: BGC component of yanuthone D in strain L14; Table S8: BGC component of melanin in strain L14; Table S9: BGC component of aureobasidin A in strain L14; Table S10: BGC component of azanigerone A in strain L14; Table S11: BGC component of xenolozoyenone in strain L14.
Author Contributions
P.W. and S.X. made a draft of this article, searched and collected all references; Y.T. organized experimental data. X.B. isolated and identified strain L14; H.W. helped in critically assessing this manuscript; and H.Z. conceived and revised this article. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
All authors declare no conflict of interest.
Funding Statement
This research was funded by the National Key Research and Development Program of China, grant number 2018YFC0311004, by the National Natural Science Foundation of China grant number 41776139 and by the Fundamental Research Fund for the Provincial Universities of Zhejiang, grant number RF-C2019002.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Wang K., Ding P. New bioactive metabolites from the marine-derived fungi Aspergillus. Mini-Rev. Med. Chem. 2018;18:1072–1094. doi: 10.2174/1389557518666180305160856. [DOI] [PubMed] [Google Scholar]
- 2.Lee Y., Kim M., Li H., Zhang P., Bao B.Q., Lee K., Jung J. Marine-derived Aspergillus species as a source of bioactive secondary metabolites. Mar. Biotechnol. 2013;15:499–519. doi: 10.1007/s10126-013-9506-3. [DOI] [PubMed] [Google Scholar]
- 3.Kamiya K., Arai M., Setiawan A., Kobayashi M. Anti-dormant mycobacterial activity of viomellein and xanthomegnin, naphthoquinone dimers produced by marine-derived Aspergillus sp. Nat. Prod. Commun. 2017;12:579–581. doi: 10.1177/1934578X1701200428. [DOI] [PubMed] [Google Scholar]
- 4.Huang R., Han X., Liu J., Zhu Q., Zhang Y., Wang D., Zhang C., Zhao D. Research progress on secondary metabolites with anti-phytopathogenic activities of marine-derived Aspergillus sp.and Trichoderma sp. J. Agric. Sci. 2020;36:1332–1341. [Google Scholar]
- 5.Machida M. Genome sequencing of Aspergillus oryzae. Tanpakushitsu Kakusan Koso. 2006;51:452–456. [PubMed] [Google Scholar]
- 6.Medema M., Blin K., Cimermancic P., de Jager V., Zakrzewski P., Fischbach M., Weber T., Takano E., Breitling R. AntiSMASH: Rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39:W339–W346. doi: 10.1093/nar/gkr466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Blin K., Medema M., Kottmann R., Lee S., Weber T. The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. 2017;45:D555–D559. doi: 10.1093/nar/gkw960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Alekseev K.V., Dubina M.V., Komov V.P. Metabolic characteristics of citric acid synthesis by the fungus Aspergillus niger. Appl. Biochem. Microbiol. 2015;51:857–865. doi: 10.1134/S0003683815090021. [DOI] [Google Scholar]
- 9.Zhao M.L., Lu X.Y., Zong H., Li J.Y., Zhuge B. Itaconic acid production in microorganisms. Biotechnol. Lett. 2018;40:455–464. doi: 10.1007/s10529-017-2500-5. [DOI] [PubMed] [Google Scholar]
- 10.Hasegawa Y., Fukuda T., Hagimori K., Tomoda H., Omura S. Tensyuic acids, new antibiotics produced by Aspergillus niger FKI-2342. Chem. Pharm. Bull. 2007;55:1338–1341. doi: 10.1248/cpb.55.1338. [DOI] [PubMed] [Google Scholar]
- 11.Kato M. An overview of the CCAAT-box binding factor in filamentous fungi: Assembly, nuclear translocation, and transcriptional enhancement. Biosci. Biotechnol. Biochem. 2005;69:663–672. doi: 10.1271/bbb.69.663. [DOI] [PubMed] [Google Scholar]
- 12.Bai X., Dong M., Lai T., Zhang H. Antimicrobial evaluation of the crude extract of symbiotic fungi from marine sponge Reniera japonica. Bangladesh J. Pharmacol. 2018;13:35–40. doi: 10.3329/bjp.v13i1.34468. [DOI] [Google Scholar]
- 13.Leggett R., Clark M. A world of opportunities with nanopore sequencing. J. Exp. Bot. 2017;68:5419–5429. doi: 10.1093/jxb/erx289. [DOI] [PubMed] [Google Scholar]
- 14.Wick R., Judd L., Gorrie C., Holt K. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol. 2017;13:22. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wang J., Mu O. Soap-HT-BLAST: High throughput BLAST based on web services. Bioinformatics. 2003;19:1863–1864. doi: 10.1093/bioinformatics/btg244. [DOI] [PubMed] [Google Scholar]
- 16.Thomas P.D., Mi H.Y., Lewis S. Ontology annotation: Mapping genomic regions to biological function. Curr. Opin. Chem. Biol. 2007;11:4–11. doi: 10.1016/j.cbpa.2006.11.039. [DOI] [PubMed] [Google Scholar]
- 17.Li X., Liu Z., Li J., Fang H. Recent progress and application of KEGG database in the research of bioinformatics. Pharm. Biotechnol. 2012;19:535–539. [Google Scholar]
- 18.Oka T., Ungar D., Hughson F., Krieger M. The COG and COPI complexes interact to control the abundance of GEARs, a subset of Golgi integral membrane proteins. Mol. Biol. Cell. 2004;15:2423–2435. doi: 10.1091/mbc.e03-09-0699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wang S., Chen G., Zhang H., Wang L. Carbohydrate-active enzyme (CAZy) database and its new prospect. Chin. J. Bioprocess Eng. 2014;12:102–108. [Google Scholar]
- 20.Blin K., Wolf T., Chevrette M.G., Lu X.W., Schwalen C.J., Kautsar S.A., Duran H.G.S., Santos E., Kim H.U., Nave M., et al. AntiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res. 2017;45:W36–W41. doi: 10.1093/nar/gkx319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Robbins T., Liu Y.C., Cane D.E., Khosla C. Structure and mechanism of assembly line polyketide synthases. Curr. Opin. Struct. Biol. 2016;41:10–18. doi: 10.1016/j.sbi.2016.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang Z., Pan H., Tang G. New insights into bacterial type II polyketide biosynthesis. F1000Research. 2017;6:172. doi: 10.12688/f1000research.10466.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Payne J., Schoppet M., Hansen M., Cryle M. Diversity of nature’s assembly lines—Recent discoveries in non-ribosomal peptide synthesis. Mol. Biosyst. 2017;13:9–22. doi: 10.1039/C6MB00675B. [DOI] [PubMed] [Google Scholar]
- 24.Zhao Z., Liu H., Wang C., Xu J. Comparative analysis of fungal genomes reveals different plant cell wall degrading capacity in fungi. BMC Genom. 2013;14:15. doi: 10.1186/1471-2164-14-274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Li Y., Chooi Y., Sheng Y., Valentine J., Tang Y. Comparative characterization of fungal anthracenone and naphthacenedione biosynthetic pathways reveals an alpha-hydroxylation-dependent claisen-like cyclization catalyzed by a dimanganese thioesterase. J. Am. Chem. Soc. 2011;133:15773–15785. doi: 10.1021/ja206906d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Holm D., Petersen L., Klitgaard A., Knudsen P., Jarczynska Z., Nielsen K., Gotfredsen C., Larsen T., Mortensen U. Molecular and chemical characterization of the biosynthesis of the 6-MSA-drived meroterpenoid yanuthone D in Aspergillus niger. Chem. Biol. 2014;21:519–529. doi: 10.1016/j.chembiol.2014.01.013. [DOI] [PubMed] [Google Scholar]
- 27.Kihara J., Moriwaki A., Tanaka N., Tanaka C., Ueno M., Arase S. Characterization of the BMR1 gene encoding a transcription factor for melanin biosynthesis genes in the phytopathogenic fungus Bipolaris oryzae. FEMS Microbiol. Lett. 2008;281:221–227. doi: 10.1111/j.1574-6968.2008.01101.x. [DOI] [PubMed] [Google Scholar]
- 28.Kihara J., Moriwaki A., Ueno M., Tokunaga T., Arase S., Honda Y. Cloning, functional analysis and expression of a scytalone dehydratase gene (SCD1) involved in melanin biosynthesis of the phytopathogenic fungus Bipolaris oryzae. Curr. Genet. 2004;45:197–204. doi: 10.1007/s00294-003-0477-1. [DOI] [PubMed] [Google Scholar]
- 29.Kihara J., Moriwaki A., Ito M., Arase S., Honda Y. Expression of THR1, a 1,3,8-trihydroxynaphthalene reductase gene involved in melanin biosynthesis in the phytopathogenic fungus Bipolaris oryzae, is enhanced by near-ultraviolet radiation. Pigm. Cell. Res. 2004;17:15–23. doi: 10.1046/j.1600-0749.2003.00102.x. [DOI] [PubMed] [Google Scholar]
- 30.Moriwaki A., Kihara J., Kobayashi T., Tokunaga T., Arase S., Honda Y. Insertional mutagenesis and characterization of a polyketide synthase gene (PKS1) required for melanin biosynthesis in Bipolaris oryzae. FEMS Microbiol. Lett. 2004;238:1–8. doi: 10.1016/j.femsle.2004.07.007. [DOI] [PubMed] [Google Scholar]
- 31.Zabala A., Xu W., Chooi Y., Tang Y. Characterization of a silent azaphilone gene cluster from Aspergillus niger ATCC 1015 reveals a hydroxylation-mediated pyran-ring formation. Chem. Biol. 2012;19:1049–1059. doi: 10.1016/j.chembiol.2012.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Andersen M., Nielsen J.B., Klitgaard A., Petersen L., Zachariasen M., Hansen T.J., Blicher L.H., Gotfredsen C., Larsen T., Nielsen K., et al. Accurate prediction of secondary metabolite gene clusters in filamentous fungi. Proc. Natl. Acad. Sci. USA. 2013;110:E99–E107. doi: 10.1073/pnas.1205532110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Slightom J., Metzger B., Luu H., Elhammer A. Cloning and molecular characterization of the gene encoding the aureobasidin A biosynthesis complex in Aureobasidium pullulans BP-1938. Gene. 2009;431:67–79. doi: 10.1016/j.gene.2008.11.011. [DOI] [PubMed] [Google Scholar]
- 34.Yue Q., Chen L., Li Y., Bills G., Zhang X., Xiang M., Li S.J., Che Y., Wang C., Niu X., et al. Functional operons in secondary metabolic gene clusters in glarea lozoyensis (fungi, mscomycota, leotiomycetes) mBio. 2015;6:10. doi: 10.1128/mBio.00703-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Not applicable.